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Customer No. 22,852 
Attorney Docket No. 01975.0034 
IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



In re Application of: 
Willy DELEERSNIJDER et al. 
Serial No.: Not Yet Assigned 
Filed: March 22, 2002 



For: HUMAN G-PROTEIN 

COUPLED RECEPTOR 



U.S. NATIONAL PHASE OF PCT/EPOO/09584 

Assistant Commissioner for Patents 
Washington, DC 20231 

BOX: PCT 

Sir: 



Group Art Unit: 



Examiner: 



PRELIMINARY AMENDMENT 

Prior to examination, please amend the above-identified application as follows: 



IN THE SPECIFICATION: 

Please amend the specification as follows: 

Page 1 , after the title, insert the following new heading and paragraph: 
CROSS REFERENCE TO RELATED APPLICATIONS 

The application claims the benefit of U.S. Provisional Application No. 60/222,047, 
filed July 31, 2000, the content of which is incorporated herein by reference, and claims 
the right to priority based on European Patent Application No. 99203140.1, filed 
September 24, 1999, Netherlands Patent Application No. 1013140, filed September 24, 
1999, and European Patent Application No. 00202683.9, filed July 28, 2000. 




PATENT 

Attorney Docket No. 01975.0034 
CUSTOMER NUMBER 22,852 



REMARKS 



The examiner is respectfully requested to consider the above preliminary 
amendment prior to examination of the application. No new matter has been introduced 
by these amendments. 

If there are any fees due in connection with the filing of this amendment, please 
charge the fees to Deposit Account No. 06-0916. If a fee is required for an extension of 
time under 37 C.F.R. § 1.136 not accounted for above, such an extension is requested 
and the fee should also be charged to our deposit account. 
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DUNNER LLP 

1 300 I Street, NW 
Washington, DC 20005 
202.408.4000 
Fax 202.408.4400 
www.finnegan.com 



Respectfully submitted, 



FINNEGAN, HENDERSON, FARABOW, 
GARRETT & DUNNER, L.L.P. 



Dated: Marcti 22, 2002 
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Attorney Docket No. 01975.0034 
Customer Number 22,852 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



In re U.S. National Phase Application of: 
PCT/EPOO/09584 

Inventors: Willy DELEERSNIJDER et al. 

Serial No.: 10/088,744 

Filed: March 22, 2002 

For: HUMAN G-PROTEIN COUPLED 
RECEPTOR 



Assistant Commissioner for Patents 
Washington, DC 20231 



Group Art Unit: 



Examiner: 



Sir: 



STATEMENT 

The information recorded in computer readable form (attached diskette) is 



identical to the written sequence listing. 



Respectfully submitted, 

FINNEGAN, HENDERSON. FARABOW, 
GARRETT & DUNNER, L.L.P. 



Dated: July 22, 2002 
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Novel human G-protein coupled receptor 
Description 

5 The present invention relates to novel identified polynucleotides, polypeptides encoded by them 

and to the use of such polynucleotides and polypeptides, and to their production. More particularly, the 
polynucleotides and polypeptides of the present Invention relate to a G-proteIn coupled receptor 
(GPCR), hereinafter referred to as IGS4, IGS4 exists In two polymorphic forms, hereinafter referred to 
as IGS4A and IGS4B. The invention also relates to inhibiting or activating the action of such 
10 polynucleotides and polypeptides, to a vector containing said polynucleotides, a host cell containing 
such vector and transgenic animals where the IGS4-gene is either overexpressed, misexpressed, 
underexpressed and/or suppressed (knock-out animals). The invention further relates to a method for 
screening compounds capable to act as an agonist or an antagonist of said G-protein coupled receptor 
IGS4, and to the cognate ligand of IGS4. 



BACKGROUND OF THE INVENTION 



It is well established that many medically significant biological processes are mediated by proteins 

participating in signal transduction pathways that involve G-proteins and/or second messengers; e.g., 
20 cAMP (Lefkowltz, Nature. 1991. 351:353-354). Herein these proteins are referred to as proteins 

participating in pathways with G-proteins. Some examples of these proteins include the GPC receptors; 

such as those for adrenergic agents and dopamine (Kobilka, B.K.. et al., Proc. Natl. Acad. Sci., USA. 

1987, 84:46-50; Kobilka, B.K., et al., Science. 1987, 238:650-656; Bunzow, J.R., et al.. Nature, 1988. 

336:783-787), G-proteins themselves, effector proteins, e.g., phospholipase C, adenylate cyclase, and 
25 phosphodiesterase, and actuator proteins, e.g., protein kinase A and protein kinase C (Simon. M.I., et 

al., Science. 1991. 252:802-8). 

For example, in one fonm of signal transduction, upon hormone binding to a GPCR the receptor 
interacts with the heterotrimeric G-protein and induces the dissociation of GDP from the guanine 

30 nucleotide-binding site. At normal cellular concentrations of guanine nucleotides, GTP fills the site 
immediately. Binding of GTP to the a subunit of the G-protein causes the dissociation of the G-protein 
from the receptor and the dissociation of the G-protein Into a and py subunits. The GTP-canrying form 
then binds to activated adenylate cyclase. Hydrolysis of GTP to GDP, catalyzed by the G-protein itself 
(a subunit possesses an intrinsic GTPase activity), returns the G-protein to its basal, inactive form. The 

35 GTPase activity of the a subunit is. in essence, an internal clock that controls an on/off switch. The GDP 
bound form of the a subunit has high affinity for py and subsequent reassociation of aGDP with Py 
retums the system to the basal state. Thus the G-protein serves a dual role, as an intermediate that 
relays the signal from receptor to effector (in this example adenylate cyclase), and as a clock that 
controls the duration of the signal. 



CONFIRMATION COPY 
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The membrane bound superfamily of G-protein coupled receptors has been characterized as 
having seven putative transmembrane domains. The domains are believed to represent transmembrane 
a-helices connected by extracellular or cytoplasmic loops. G-protein coupled receptors include a wide 
5 range of biologically active receptors, such as hormone, viral, growth factor and neuroreceptors. 

The G'protein coupled receptor family includes dopamine receptors which bind to neuroleptic 
drugs used for treating CNS disorders. Other examples of members of this family Include, but are not 
limited to calcitonin, adrenergic, neuropeptideY, somastotatin. neurotensin, neurokinin, capsaicin, VIP, 
10 CGRP, CRF, CCK. bradyl<inin. galanin, motilin, nociceptin, endothelin, cAMP. adenosine, muscarinic, 
acetylcholine, serotonin, histamine, thrombin, kinin, follicle stimulating hormone, opsin, endothelial 
differentiation gene-1, rhodopsin, odorant. and cytomegalovirus receptors. 

Most G-protein coupled receptors have single conserved cysteine residues in each of the first two 
15 extracellular loops which form disulfide bonds that are believed to stabilize functional protein structures. 
The 7 transmembrane regions are designated as TM1, TM2, TM3, TM4. TM5, TM6 and TM7. The 
cytoplasmic loop which connects TM5 and TM6 may be a major component of the G-protein binding 
domain. 

20 Most G-protein coupled receptors contain potential phosphorylation sites within the third 

cytoplasmic loop and/or the carboxy terminus. For several G-protein coupled receptors, such as the p- 
adrenoreceptor, phosphorylation by protein kinase A and/or specific receptor kinases mediates receptor 
desensitization. 

25 Recently, it was discovered that certain GPCRs. like the calcitonln-receptor like receptor, might 

interact with small single pass membrane proteins called receptor activity modifying proteins (RAMP's). 
This interaction of the GPCR with a certain RAMP is determining which natural ligands have relevant 
affinity for the GPCR-RAMP combination and regulate the functional signaling activity of the complex 
(McLathie, L.M. et al., Nature (1998) 393:333-339). 

30 

For some receptors, the ligand binding sites of G-protein coupled receptors are believed to 
comprise hydrophilic sockets formed by several G-protein coupled receptor transmembrane domains, 
said sockets being surrounded by hydrophobic residues of the G-protein coupled receptors. The 
hydrophilic side of each G-protein coupled receptor transmembrane helix is postulated to face inward 
35 and form a polar ligand-binding site. TM3 has been implicated in several G-protein coupled receptors as 
having a ligand-binding site, such as the TM3 aspartate residue. TM5 serines, a TM6 asparagine and 
TM6 or TM7 phenylalanines or tyrosines are also implicated in ligand binding. 
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G-protein coupled receptors can be intracellularly coupled by heterotrimeric G-proteins to various 
Intracellular enzymes, ion channels and transporters (see. Johnson et ah, Endoc. Rev., 1989, 10:317- 
331). Different G-proteIn a-subunits preferentially stimulate particular effectors to modulate various 
biological functions in a cell. Phosphorylation of cytoplasmic residues of G-protein coupled receptors 
has been identified as an important mechanism for the regulation of G-proteIn coupling of some G- 
protein coupled receptors. G-protein coupled receptors are found in numerous sites within a mammalian 
host. 

Receptors - primarily the GPGR class - have led to more than half of the currently known drugs 
(Drews. Nature Biotechnology. 1996. 14: 1516). This indicates that these receptors have an established, 
proven history as therapeutic targets. Clearly there is a need for identification and characterization of 
further receptors which can play a role in preventing, ameliorating or correcting dysfunctions or 
diseases, including, but not limited to PNS, psychiatric and CNS disorders, including schizophrenia, 
episodic paroxysmal anxiety (EPA) disorders such as obsessive compulsive disorder (OCD), post 
traumatic stress disorder (PTSD), phobia and panic, major depressive disorder, bipolar disorder. 
Parkinson's disease, general anxiety disorder, autism, delirium, multiple sclerosis, Alzheimer 
disease/dementia and other neurodegenerative diseases, severe mental retardation, dyskinesias, 
Huntington's disease, Tourett's syndrome, tics, tremor, dystonia, spasms, anorexia, bulimia, stroke, 
addiction/dependency/craving, sleep disorder, epilepsy, migraine; attention deficit/hyperactivlty disorder 
(ADHD): cardiovascular diseases, including heart feilure, angina pectoris, arrhythmias, myocardial 
infarction, cardiac hypertrophy, hypotension, hypertension - e.g. essential hypertension, renal 
hypertension, or pulmonary hypertension, thrombosis, arteriosclerosis, cerebral vasospasm, 
subarachnoid hemorrhage, cerebral ischemia, cerebral infarction, peripheral vascular disease, 
Raynaud's disease, kidney disease - e.g. renal failure; dyslipidemias; obesity; emesis; gastrointestinal 
disorders, including irritable bowel syndrome (IBS), inflammatory bowel disease (IBD), gastroesophagal 
reflux disease (GERD), motility disorders and conditions of delayed gastric emptying, such as post 
operative or diabetic gastroparesis, and diabetes, ulcers - e.g. gastric ulcer; diarrhoea; other diseases 
including osteoporosis; inflammations; infections such as bacterial, fungal, protozoan and viral 
infections, particularly infections caused by HIV-1 or HI\/'2; pain; cancers; chemotherapy induced injury; 
tumor invasion; immune disorders; urinary retention; asthma; allergies; arthritis; benign prostatic 
hypertrophy; endotoxin shock; sepsis; complication of diabetes mellitus; and gynaecological disorders. 

SUMMARY OF THE INVENTION 

In one aspect, the invention relates to IGS4 polypeptides (including the IGS4A and IGS4B 
polypeptide polymorphs), polynucleotides and recombinant materials and methods for their production. 
Another aspect of the invention relates to methods for using such IGS4 polypeptides, polynucleotides 
and recombinant materials. Such uses include, but are not limited to, use as a therapeutic target and for 
the treatment of PNS. psychiatric and CNS disorders, including schizophrenia, episodic paroxysmal 
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anxlety (EPA) disorders such as obsessive compulsive disorder (OCD), post traumatic stress disorder 
(PTSD). phobia and panic, major depressive disorder, bipolar disorder, Parl<inson's disease, general 
anxiety disorder, autism, delirium, multiple sclerosis, Alzheimer disease/dementia and other 
neurodegenerative diseases, severe mental retardation, dyskinesias. Huntington's disease, Tourett's 
5 syndrome, tics, tremor, dystonia, spasms, anorexia, bulimia, stroke, addiction/dependency/craving, 
sleep disorder, epilepsy, migraine; attention deflcit/hyperactivity disorder (ADHD); cardiovascular 
diseases, including heart failure, angina pectoris, arrhythmias, myocardial infarction, cardiac 
hypertrophy, hypotension, hypertension - e.g. essential hypertension, renal hypertension, or pulmonary 
hypertension, thrombosis, arteriosclerosis, cerebral vasospasm, subarachnoid hemorrhage, cerebral 

10 ischemia, cerebral infarction, peripheral vascular disease, Raynaud's disease, kidney disease — e.g. 
renal failure; dyslipidemias; obesity; emesis; gastrointestinal disorders, including irritable bowel 
syndrome (IBS), inflammatory bowel disease (IBD), gastroesophagal reflux disease (GERD), motility 
disorders and conditions of delayed gastric emptying, such as post operative or diabetic gastroparesis, 
and diabetes, ulcers - e.g. gastric ulcer; diarrhoea; other diseases including osteoporosis: 

15 inflammations; infections such as bacterial, fungal, protozoan and viral infections, particularly infections 
caused by HIV-1 or HIV-2; pain; cancers; chemotherapy induced injury; tumor invasion; immune 
disorders; urinary retention; asthma; allergies; arthritis; benign prostatic hypertrophy; endotoxin shock; 
sepsis; complication of diabetes mellitus; and gynaecological disorders, among others. Preferred uses 
of the invention relate to disorders of the nervous system, including the central nervous system (CNS) 

20 and the peripheral nervous system (PNS), disorders of the gastrointestinal system and/or of the 
cardiovascular system and/or of skeletal muscle and/or of the thyroid, and/or also to lung diseases. 
Immunological diseases and disorders of the genitourinary system. 

In still another aspect, the invention relates to methods to identify agonists and antagonists using 
25 the materials provided by the invention, and treating conditions associated with IGS4 imbalance with the 
identified compounds. Yet another aspect of the Invention relates to diagnostic assays for detecting 
diseases associated with inappropriate IGS4 activity or levels. A further aspect of the invention relates to 
animal-based systems which act as models for disorders arising from aberrant expression or activity of 
IGS4. Preferred agonists or antagonists identified according to the present invention are those which are 
30 suited for the treatment of disorders of the nervous system, including the central nervous system (CNS) 
and the peripheral nervous system (PNS), disorders of the gastrointestinal system and/or of the 
cardiovascular system and/or of skeletal muscle and/or of the thyroid, and/or also to lung diseases, 
immunological diseases and disorders of the genitourinary system. 

35 The invention also relates to the Identification of the cognate ligand of the IGS4 polypeptides of 

the invention. High affinity binding to said IGS4 polypeptides is found for the neuropeptides known as 
neuromedin U. 
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Table 1 : iGS4A-DNA of SEQ ID NO: 1 and SEQ ID NO: 3 

5' - ' ~ ~ 

GGCTCAGCTTGAAACAGAGCCTCGTACCAGGGGAGGCTCAGGCCTTGGATTTTAATGTCA 
GGGATGGAAAAACTTCAGAATGCTTCCTGGATCTACCAGCAGAAACTAGAAGATCCATTC 
CAGAAACACCTGAACAGCACCGAGGAGTATCTGGCCTTCCTCTGCGGACCTCGGCGCAGC 
CACTTCTTCCTCCCCGTGTCTGTGGTGTATGTGCCAATTTTTGTGGTGGGGGTCATTGGC 
AATGTCCTGGTGTGCCTGGTGATTCTGCAGCACCAGGCTATGAAGACGCCCACCAACTAC 
TACCTCTTCAGCCTGGCGGTCTCTGACCTCCTGGTCCTGCTCCTTGGAATGCCCCTGGAG 
GTCTATGAGATGTGGCGCAACTACCCTTTCTTGTTCGGGCCCGTGGGCTGCTACTTCAAG 
ACGGCCCTCTTTGAGACCGTGTGCTTCGCCTCCATCCTCAGCATCACCACCGTCAGCGTG 
GAGCGCTACGTGGCCATCCTACACCCGTTCCGCGCCAAACTGCAGAGCACCCGGCGCCGG 
GCCCTCAGGATCCTCGGCATCGTCTGGGGCTTCTCCGTGCTCTTCTCCCTGCCCAACACC 
AGCATCCATGGCATCAAGTTCCACTACTTCCCCAATGGGTCCCTGGTCCCAGGTTCGGCC 
ACCTGTACGGTCATCAAGCCCATGTGGATCTACAATTTCATCATCCAGGTCACCTCCTTC 
CTATTCTACCTCCTCCCCATGACTGTCATCAGTGTCCTCTACTACCTCATGGCACTCAGA 
CTAAAGTU^GACAAATCTCTTGAGGCAGATGAAGGGAATGCAAATATTCAAAGACCCTGC 
AGAAAATCAGTCAACAAGATGCTGTTTGTCTTGGTCTTAGTGTTTGCTATCTGTTGGGCC 
CCGTTCCACATTGACCGACTCTTCTTCAGCTTTGTGGAGGAGTGGAGTGAATCCCTGGCT 
GCTGTGTTCAACCTCGTCCATGTGGTGTCAGGTGTCTTCTTCTACCTGAGCTCAGCTGTC 
AACCCCATTATCTATAACCTACTGTCTCGCCGCTTCCAGGCAGCATTCCAGAATGTGATC 
TCTTCTTTCCACAAACAGTGGCACTCCCAGCATGACCCACAGTTGCCACCTGCCCAGCGG 
AACATCTTCCTGACAGAATGCCACTTTGTGGAGCTGACCGAAGATATAGGTCCCCAATTC 
CCATGTCAGTCATCCATGCACAACTCTCACCTCCCAACAGCCCTCTCTAGTGAACAGATG 
TCAAGAACAAACTATCAAAGCTTCCACTTTAACAAAACCTGAATTCTTTCAGAGCTGACT 
CTCCTCTATGCCTCAAAACTTCAGAGAGGAACATCCCATAATGTATGCCTTCTCATATGA 
TATTAGAGAGGTAGAATGGCTCTTACAACTCATGTACCCATTGCTAGTTTTTTTTTTTTA 
ATAAACGTGAAAACTGAGAGTTAGATCTGGTTTCAAAACCCAAGACTGCCTGATTTTTAG 
TTATCTTTCCACTATCCTAACTGCCTCATGCCCCTTCACTAGTTCATGCCAAGAACGTGA 
CTGGAAAGGCATGGCACCTATACCTTGATTAATTTCCATTAATGGAAATGGTTCGTCCTG 
AGTCATCTACGTTCCGAGTCAGGCTGTCACTCCTACTA- 3 ' 
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Table 2: IGS4B-DNA of SEQ ID NO: 5 and SEQ ID NO: 7 

5' - " " ~~~~~ 

GGCTCAGCTTGAAACAGAGCCTCGTACCAGGGGAGGCTCAGGCCTTGGATTTTAATGTCA 
GGGATGGAAAAACTTCAGAATGCTTCCTGGATCTACCAGCAGAAACTAGAAGATCCATTC 
CAGAAACACCTGAACAGCACCGAGGAGTATCTGGCCTTCCTCTGCGGACCTCGGCGCAGC 
CACTTCTTCCTCCCCGTGTCTGTGGTGTATGTGCCAATTTTTGTGGTGGGGGTCATTGGC 
AATGTCCTGGTGTGCCTGGTGATTCTGCAGCACCAGGCTATGAAGACGCCCACCAACTAC 
TACCTCTTCAGCCTGGCGGTCTCTGACCTCCTGGTCCTGCTCCTTGGAATGCCCCTGGAG 
GTCTATGAGATGTGGCGCAACTACCCTTTCTTGTTCGGGCCCGTGGGCTGCTACTTCAAG 
ACGGCCCTCTTTGAGACCGTGTGCTTCGCCTCCATCCTCAGCATCACCACCGTCAGCGTG 
GAGCGCTACGTGGCCATCCTACACCCGTTCCGCGCCAAACTGCAGAGCACCCGGCGCCGG 
GCCCTCAGGATCCTCGGCATCGTCTGGGGCTTCTCCGTGCTCTTCTCCCTGCCCAACACC 
AGCATCCATGGCATCAAGTTCCACTACTTCCCCAATGGGTCCCTGGTCCCAGGTTCGGCC 
ACCTGTACGGTCATCAAGCCCATGTGGATCTACAATTTCATCATCCAGGTCACCTCCTTC 
CTATTCTACCTCCTCCCCATGACTGTCATCAGTGTCCTCTACTACCTCATGGCACTCAGA 
CTAAAGAAAGACAAATCTCTTGAGGCAGATGAAGGGAATGCAAATATTCAAAGACCCTGC 
AGAAAATCAGTCAACAAGATGCTGTTTGTCTTGGTCTTAGTGTTTGCTATCTGTTGGGCC 
CCGTTCCACATTGACCGACTCTTCTTCAGCTTTGTGGAGGAGTGGACTGAATCCCTGGCT 
GCTGTGTTCAACCTCGTCCATGTGGTGTCAGGTGTCTTATTCTACCTGAGCTCAGCTGTC 
AACCCCATTATCTATAACCTACTGTCTCGCCGCTTCCAGGCAGCATTCCAGAATGTGATC 
TCTTCTTTCCACAAACAGTGGCACTCCCAGCATGACCCACAGTTGCCACCTGCCCAGCGG 
AACATCTTCCTGACAGAATGCCACTTTGTGGAGCTGACCGAAGATATAGGTCCCCAATTC 
CTATGTCAGTCATCCGTGCACAACTCTCACCTCCCAACAGCCCTCTCTAGTGAACAGATG 
TCAAGAACAAACTATCAAAGCTTCCACTTTAACAAAACCTGAATTCTTTCAGAGCTGACT 
CTCCTCTATGCCTCAAAACTTCAGAGAGGAACATCCCATAATGTATGCCTTCTCATATGA 
AATTAGAGAGGTAGAATGGCTCTTACAACTCATGTACCCATTGCTAGTTTTTTTTTTTTA 
ATAAACGTGT^AAACTGAGAGTTAGATCTGGTTTCAAAACCCAAGACTGCCTGATTTTTAG 
TTATCTTTCCACTATCCTAACTGCCTCATGCCCCTTCACTAGTTCATGCCAAGAACGTGA 
CTGGAAAGGCATGGCACCTATACCTTGATTAATTTCCATTAATGGAAATGGTTCGTCCTG 
AGTCATCTACGTTCCGAGTCAGGCTGTCACTCCTACTA- 3 ' 
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Table 3: IGS4A-64-DNA of SEQ ID NO: 9 and SEQ ID NO: 11 



5' - 

GGCTCAGCTTGAAACAGAGCCTCGTACCAGGGGAGGCTCAGGCCTTGGATTTTAATGTCA 
GGGATGGAAAAACTTCAGAATGCTTCCTGGATCTACCAGCAGAAACTAGAAGATCCATTC 
CAGAAACACCTGAACAGCACCGAGGAGTATCTGGCCTTCCTCTGCGGACCTCGGCGCAGC 
CACTTCTTCCTCCCCGTGTCTGTGGTGTATGTGCCAATTTTTGTGGTGGGGGTCATTGGC 
AATGTCCTGGTGTGCCTGGTGATTCTGCAGCACCAGGCTATGAAGACGCCCACCAACTAC 
1 0 - TACCTCTTCAGCCTGGCGGTCTCTGACCTCCTGGTCCTGCTCCTTGGAATGCCCCTGGAG 
GTCTATGAGATGTGGCGCAACTACCCTTTCTTGTTCGGGCCCGTGGGCTGCTACTTCAAG 
ACGGCCCTCTTTGAGACCGTGTGCTTCGCCTCCATCCTCAGCATCACCACCGTCAGCGTG 
GAGCGCTACGTGGCCATCCTACACCCGTTCCGCGCCAAACTGCAGAGCACCCGGCGCCGG 
GCCCTCAGGATCCTCGGCATCGTCTGGGGCTTCTCCGTGCTCTTCTCCCTGCCCAACACC 
15 AGCATCCATGGCATCAAGTTCCACTACTTCCCCAATGGGTCCCTGGTCCCAGGTTCGGCC 
ACCTGTACGGTCATCAAGCCCATGTGGATCTACAATTTCATCATCCAGGTCACCTCCTTC 
CTATTCTACCTCCTCCCCATGACTGTCATCAGTGTCCTCTACTACCTCATGGCACTCAGA 
CTAAAGAAAGACAAATCTCTTGAGGCAGATGAAGGGAATGCAAATATTCAAAGACCCTGC 
AGAAAATCAGTCAACAAGATGCTGTCTTTGTGGAGGAGTGGAGTGAATCCCTGGCTGCTG 
20 TGTTCAACCTCGTCCATGTGGTGTCAGGTGTCTTCTTCTACCTGAGCTCAGCTGTCAACC 
CCATTATCTATAACCTACTGTCTCGCCGCTTCCAGGCAGCATTCCAGAATGTGATCTCTT 
CTTTCCACAAACAGTGGCACTCCCAGCATGACCCACAGTTGCCACCTGCCCAGCGGAACA 
TCTTCCTGACAGAATGCCACTTTGTGGAGCTGACCGAAGATATAGGTCCCCAATTCCCAT 
GTCAGTCATCCATGCACAACTCTCACCTCCCAACAGCCCTCTCTAGTGAACAGATGTCAA 
25 GAACAAACTATCAAAGCTTCCACTTTAACAAAACCTGAATTCTTTCAGAGCTGACTCTCC 
TCTATGCCTCAAAACTTCAGAGAGGAACATCCCATAATGTATGCCTTCTCATATGATATT 
AGAGAGGTAGAATGGCTCTTACAACTCATGTACCCATTGCTAGTTTTTTTTTTTTAATAA 
ACGTGAAAACTGAGAGTTAGATCTGGTTTCAAAACCCAAGACTGCCTGATTTTTAGTTAT 
CTTTCCACTATCCTAACTGCCTCATGCCCCTTCACTAGTTCATGCCAAGAACGTGACTGG 
30 AAAGGCATGGCACCTATACCTTGATTAATTTCCATTAATGGAAATGGTTCGTCCTGAGTC 
ATCTACGTTCCGAGTCAGGCTGTCACTCCTACTA- 3 ' 
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Table 4: IGS4A-protein of SEQ ID NO: 2 and SEQ ID NO: 4 (without the three amino acids 
between brackets). 



(MSG)MEKLQNASWIYQQKLEDPFQKHLNSTEEYLAFLCGPRRSHFFLPVSVVYVPIFWGV 
IGNVLVCLVILQHQAMKTPTNYYLFSLAVSDLLVIiLLGMPLEVYEMWRNYPFLFGPVGCY 
FKTALFETVCFAS ILS ITTVSVERYVAILHPFRAKLQSTRRRAIiRILGI VWGFSVLFSLP 
NTSIHGIKFHYFPNGSLVPGSATCTVIKPMWIYNFIIQVTSFLFYLIiPMTVISVLYYIiMA 
LRLKJODKSIiEADEGNANIQRPCRKSVNKMLFVLVLVFAICWAPFHIDRLFFSFVEEWSES 
L AAVFNLVHWSGVFF YLS S AVNP 1 1 YNLLSRRFQAAFQNVI S S FHKQWHSQHDPQLPPA 
QRNIFLTECHFVELTEDIGPQFPCQSSMHNSHLPTALSSEQMSRTNYQSFHFNKT 



15 



Table 5: iGS4B-protein of SEQ ID NO: 6 and SEQ ID NO: 8 (without the three annino 
acids between brackets). 



20 



(MSG) MEKLQNASWIYQQKLEDPFQKHLNSTEEYLAFIjCGPRRSHFFIjPVSVVYVPIFVVGV 

IGNVLVCliVILQHQAMKTPTlSnfYLFSLAVSDLLVLLIiGMPLEVYEMWRNYPFLFGPVGCY 

FKTALFETVCFASIIiSITTVSVERYVAILHPFRAKLQSTRRRALRIIiGIVWGFSVIjFSLP 

NTSIHGIKFHYFPNGSLVPGSATCTVIKPMWIYNFIIQVTSFIjFYLLPMTVISVIiYYLMA 

LRLKIODKSLEADEGNANIQRPCRKSVNKMLFVLVLVFAICWAPFHIDRDFFSFVEEWTES 

laavfnlvhwsgvlf yls s avnp 1 1 ynllsrrfqaafqnvi s s fhkqwhsqhdpqlppa 
qrnifltechfveltedigpqflcqssvhnshlptalsseqmsrtnyqsfhfnkt 



25 Table 6: IGS4A-^4-protein of SEQ ID NO: 10 and SEQ ID NO: 12 (without three amino acids 
between brackets). 



30 



{MSG ) meklqnaswi yqqkledpfqkhlnsteeylaflcgprrshfflpvswyvpifwgv 
ignvlvclvilqhqamktptnyylfslavsdllvlllgmplevyemwrnypflfgpvgcy 

FKTALFEWCFAS ILS ITTVSVERYVAILHPFRAKLQSTRRRALRILGI VWGFSVLFSLP 

ntsihgikfhyfpngslvpgsatctvikpmwiynfiiqvtsflfyllpmtvisvlyylma 
lrlkkdksleadegnaniqrpcrksvnkmlslwrsgvnpwllcstssmwcqvssst 
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DESCRIPTION OF THE INVENTION 



Definitions 

5 The following definitions are provided to facilitate understanding of certain terms used frequently 

herein. 

"IGS4" refers, among others, to a polypeptide comprising the amino acid sequence set forth in 
SEQ ID NO: 2 or SEQ ID NO: 4 (IGS4A) and SEQ ID NO: 6 or SEQ ID NO: 8 (IGS4B). or a variant 
10 thereof. Particularly preferred are polypeptides of IGS4B. 

"Receptor Activity" or "Biological Activity of the Receptor" refers to the metabolic or physiologic 
function of said IGS4 including similar activities or improved activities or these activities with decreased 
undesirable side effects. Also included are antigenic and immunogenic activities of said IGS4. 

15 

"IGS4-gene" refers to a polynucleotide comprising the nucleotide sequence set forth in SEQ ID 
NO: 1 , SEQ ID NO: 3. SEQ ID NO: 5 or SEQ ID NO: 7 or variants thereof and/or their complements. 

"Antibodies" as used herein includes polyclonal and monoclonal antibodies, chimeric, single 
20 chain, and humanized antibodies, as well as Fab fragments, including the products of a Fab or other 
immunoglobulin expression library. 

"Isolated" means altered "by the hand of man" from the natural state and/or separated from the 
natural environment. Thus, if an "isolated" composition or substance that occurs in nature has been 
25 "isolated", it has been changed or removed from its original environment, or both. For example, a 
polynucleotide or a polypeptide naturally present in a living animal is not "isolated", but the same 
polynucleotide or polypeptide separated from the coexisting materials of its natural state is "isolated", as 
the term is employed herein, 

30 "Polynucleotide" generally refers to any polyribonucleotide or polydeoxribonucleotide, which may 

be unmodified RNA or DNA or modified RNA or DNA. "Polynucleotides" include, without limitation 
single- and double-stranded DNA, DNA that is a mixture of single- and double-stranded regions, single- 
and double-stranded RNA. and RNA that is a mixture of single-and double-stranded regions, hybrid 
molecules comprising DNA and RNA that may be single-stranded or, more typically, double-stranded or 

35 a mixture of single- and double-stranded regions. In addition, "polynucleotide" may also include triple- 
stranded regions comprising RNA or DNA or both RNA and DNA. The term polynucleotide also includes 
DNAs or RNAs containing one or more modified bases and DNAs or RNAs with backbones modified for 
stability or for other reasons. "Modified" bases include, for example, tritylated bases and unusual bases 
such as inosine. A variety of modifications has been made to DNA and RNA; thus, "polynucleotide" 
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embraces chemically, enzymatically or metabolically modified forms of polynucleotides as typically 
found in nature, as well as the chemical forms of DNA and RNA characteristic of viruses and cells. 
"Polynucleotide" also embraces relatively short polynucleotides, often referred to as oligonucleotides. 

5 "Polypeptide" refers to any peptide or protein comprising two or more amino acids joined to each 

other by peptide bonds or modified peptide bonds, i.e.. peptide isosteres. "Polypeptide" refers to short 
chains, commonly referred to as peptides, oligopeptides or oligomers, and to longer chains, generally 
referred to as proteins, and/or to combinations thereof. Polypeptides may contain amino acids other 
than the 20 gene-encoded amino acids. "Polypeptides" include amino acid sequences modified either by 

10 natural processes, such as posttranslational processing, or by chemical modification techniques which 
are well known in the art. Such modifications are well-described in basic texts and in more detailed 
monographs, as well as in a voluminous research literature. Modifications can occur anywhere in a 
polypeptide, including the peptide backbone, the amino acid side-chains and the amino or carboxyl 
termini. It will be appreciated that the same type of modification may be present in the same or varying 

15 degrees at several sites in a given polypeptide. Also, a given polypeptide may contain many types of 
modifications. Polypeptides may be branched as a result of ubiquitination. and they may be cyclic, with 
or without branching. Cyclic, branched and branched cyclic polypeptides may result from posttranslation 
natural processes or may be made by synthetic methods. Modifications include acetylation. acylation, 
ADP-ribosylation, amidation, covalent attachment of flavin, covalent attachment of a heme moiety, 

20 covalent attachment of a nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid 
derivative, covalent attachment of phosphotidylinositol; cross-linking, cyclization, disulfide bond 
formation, demethylation. formation of covalent cross-links, formation of cystine, formation of 
pyroglutamate, formylation, gamma-carboxylation, glycosylation, GPI anchor formation, hydroxylation, 
iodination, methylation. myristoylation, oxidation, proteolytic processing, phosphorylation, prenylation, 

25 racemization, selenoylation, sulfation, transfer-RNA mediated addition of amino acids to proteins such 
as arginylation, and ubiquitination. See, for instance, PROTEINS - STRUCTURE AND MOLECULAR 
PROPERTIES. 2nd Ed., T. E. Creighton, W. H. Freeman and Company. New York, 1993 and Wold, F., 
Posttranslational Protein Modifications: Perspectives and Prospects. pgs, 1-12 in 
POSTTRANSLATIONAL COVALENT MODIFICATION OF PROTEINS, B. C. Johnson, Ed., Academic 

30 Press. New York, 1983; Seifter et a!., "Analysis for protein modifications and nonprotein cofactors", 
Meth. Enzymol. (1990) 182:626-646 and Rattan et al., "Protein Synthesis: Posttranslational 
Modifications and Aging", Ann. NY Acad. Sci. (1992) 663:48-62. 

"Variant" as the term is used herein, is a polynucleotide or polypeptide that differs from a 
35 reference polynucleotide or polypeptide respectively, but retains essential properties such as essential 
biological, structural, regulatory or biochemical propeties. A typical variant of a polynucleotide differs in 
nucleotide sequence from another, reference polynucleotide. Changes in the nucleotide sequence of the 
variant may or may not alter the amino acid sequence of a polypeptide encoded by the reference 
polynucleotide. Nucleotide changes may result in amino acid substitutions, additions, deletions, fusions 
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and truncations in the polypeptide encoded by the reference sequence, as discussed below. A typical 
variant of a polypeptide differs in amino acid sequence from another, reference polypeptide. Generally, 
differences are limited so that the sequences of the reference polypeptide and the variant are closely 
similar overall and, in many regions, identical. A variant and reference polypeptide may differ in amino 
5 acid sequence by one or more substitutions, additions, and deletions in any combination. A substituted 
or inserted amino acid residue may or may not be one encoded by the genetic code. A variant of a 
polynucleotide or polypeptide may be a naturally occurring such as an allelic variant, or it may be a 
variant that is not known to occur naturally. Non-naturally occurring variants of polynucleotides and 
polypeptides may be made by mutagenesis techniques or by direct synthesis. 

10 

"Identity" is a measure of the identity of nucleotide sequences or amino acid sequences. In 
general, the sequences are aligned so that the highest order match is obtained. "Identity" per se has an 
art>recognized meaning and can be calculated using published techniques. See. e.g.: 
(COMPUTATIONAL MOLECULAR BIOLOGY, Lesk. A.M.. ed., Oxford University Press. New York, 

15 1988; BIOCOMPUTING: INFORMATICS AND GENOME PROJECTS. Smith. D.W.. ed.; Academic 
Press. New Yori<, 1993; COMPUTER ANALYSIS OF SEQUENCE DATA, PART I, Griffin. A.M., and 
Griffin. H.G.. eds.. Humana Press. New Jersey. 1994; SEQUENCE ANALYSIS IN MOLECULAR 
BIOLOGY, von Heinje. G.. Academic Press. 1987; and SEQUENCE ANALYSIS PRIMER. Gribskov. M. 
and Devereux. J., eds., M Stockton Press, New York. 1991). While there exist a number of methods to 

20 measure identity between two polynucleotide or polypeptide sequences, the term "identity" Is well known 
to skilled artisans (Carillo, H., and Lipton. D., SIAM J. Applied Math. (1988) 48:1073). Methods 
commonly employed to determine identity or similarity between two sequences include, but are not 
limited to, those disclosed in Guide to Huge Computers, Martin J. Bishop, ed.. Academic Press. San 
Diego, 1994. and Carillo. H.. and Lipton. D.. SIAM J. Applied Math. (1988) 48:1073. Methods to 

25 determine identity and similarity are codified in computer programs. Preferred computer program 
methods to determine identity and similarity between two sequences Include, but are not limited to, GCG 
program package (Devereux, J., et al., Nucleic Acids Research (1984) 12(1):387). BLASTP, BLASTN, 
FASTA (Atschul. S.F. et al., J, Molec. Biol. (1990) 215:403). The word "homology" may substitute for the 
word "identity". 

30 

As an illustration, by a polynucleotide having a nucleotide sequence having at least, for example, 
95% "identity" to a reference nucleotide sequence of SEQ ID NO: 1 is intended that the nucleotide 
sequence of the polynucleotide is identical to the reference sequence except that the polynucleotide 
sequence may include up to five nucleotide differences per each 100 nucleotides of the reference 
35 nucleotide sequence of SEQ ID NO: 1. In other words, to obtain a polynucleotide having a nucleotide 
sequence at least 95% identical to a reference nucleotide sequence, up to any 5% of the nucleotides in 
the reference sequence may be deleted or substituted with another nucleotide, or a number of 
nucleotides up to any 5% of the total nucleotides in the reference sequence may be Inserted into the 
reference sequence, or in a number of nucleotides of up to any 5% of the total nucleotides in the 
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reference sequence there may be a combination of deletion, insertion and substitution. These mutations 
of the reference sequence may occur at the 5 or 3 terminal positions of the reference nucleotide 
sequence or anywhere between those terminal positions, interspersed either individually among 
nucleotides in the reference sequence or in one or more contiguous groups within the reference 
5 sequence. 

Similarly, by a polypeptide having an amino acid sequence having at least, for example, 95% 
"identity" to a reference amino acid sequence of SEQ ID NO: 2 is intended that the amino acid sequence 
of the polypeptide is identical to the reference sequence except that the polypeptide sequence may 

10 include up to five amino acid alterations per each 100 amino acids of the reference amino acid of SEQ 
ID NO: 2. In other words, to obtain a polypeptide having an amino acid sequence at least 95% identical 
to a reference amino acid sequence, up to any 5% of the amino acid residues in the reference sequence 
may be deleted or substituted with another amino acid, or a number of amino acids up to any 5% of the 
total amino acid residues in the reference sequence may be inserted into the reference sequence. 

1 5 These alterations of the reference sequence may occur at the amino or carboxy terminal positions of the 
reference amino acid sequence or anywhere between those terminal positions, interspersed either 
individually among residues in the reference sequence or in one or more contiguous groups within the 
reference sequence. 

20 Polypeptides of the Invention 

In one aspect, the present invention relates to IGS4 polypeptides (or IGS4 proteins). The IGS4 
polypeptides include the polypeptide of SEQ ID NO: 2. SEQ ID NO: 4. SEQ ID NO: 6 and SEQ ID NO: 8 
and the polypeptide having the amino acid sequence encoded by the DNA insert contained in the 

25 deposit no. CBS1 02221 or deposit no. CBS1 02222, deposited on September 24, 1999 at the 
Centraalbureau voor Schimmelcultures at Baarn the Netherlands; as well as polypeptides comprising 
the amino acid sequence of SEQ ID NO: 2, SEQ ID NO: 4. SEQ ID NO: 6 or SEQ ID NO: 8 and the 
polypeptide having the amino acid sequence encoded by the DNA insert contained In the deposit no. 
CBS1 02221 or deposit no. CBS1 02222 at the Centraalbureau voor Schimmelcultures at Baarn the 

30 Netherlands and polypeptides comprising the amino acid sequence which have at least 80% identity to 
that of SEQ ID NO: 2, SEQ ID NO: 4. SEQ ID NO: 6 or SEQ ID NO: 8 and/or the polypeptide having the 
amino acid sequence encoded by the DNA insert contained in the deposit no, CBS1 02221 or deposit no. 
CBS1 02222 at the Centraalbureau voor Schimmelcultures at Baam the Netherlands over its entire 
length, and still more preferably at least 90% identity, and even still more preferably at least 95% Identity 

35 to said amino acid sequences. Furthermore, those with at least 97%, in particular at least 99%, are 
highly preferred. Also included within IGS4 polypeptides are polypeptides having the amino acid 
sequence which has at least 80% identity to the polypeptide having the amino acid sequence of SEQ ID 
NO: 2, SEQ ID NO: 4, SEQ ID NO: 6 or SEQ ID NO: 8 or the polypeptide having the amino acid 
sequence encoded by the DNA insert contained in the deposit no. CBS1 02221 or deposit 
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no.CBSI 02222 at the Centraalbureau voor Schimmelcultures at Baarn the Netherlands over its entire 
length, and still more preferably at least 90% identity, and even still more preferably at least 95% identity 
to SEQ ID NO: 2. SEQ ID NO: 4. SEQ ID NO: 6 or SEQ ID NO: 8. Furthermore, those with at least 97%. 
in particular at least 99% are highly preferred. Preferably IGS4 polypeptides exhibit at least one 
5 biological activity of the receptor. 

In an additional embodiment of the invention, the IGS4 polypeptides may be a part of a larger 
protein such as a fusion protein. It is often advantageous to include an additional amino acid sequence 
which contains secretory or leader sequences, pro-sequences, sequences which aid in purification such 
10 as multiple histidine residues, sequences which aid in detection such as antigenic peptide tags (such as 
the haemaggiutinin (HA) tag), or an additional sequence for stability during recombinant production. 

Fragments of the IGS4 polypeptides are also included in the Invention. A fragment is a 
polypeptide having an amino acid sequence that is the same as part of, but not all of, the amino acid 

15 sequence of the aforementioned IGS4 polypeptides. As with IGS4 polypeptides, fragments may be 
"free-standing," or comprised within a larger polypeptide of which they form a part or region, most 
preferably as a single continuous region. Representative examples of polypeptide fragments of the 
invention, include, for example, fragments from about amino acid number 1-20; 21-40, 41-60, 61-80, 81- 
100; and 101 to the end of IGS4 polypeptide. In this context "about" includes the particularly recited 

20 ranges larger or smaller by several, 5, 4, 3, 2 or 1 amino acid at either extreme or at both extremes. 

Preferred fragments include, for example, truncation polypeptides having the amino acid 
sequence of IGS4 polypeptides, except for deletion of a continuous series of residues that includes the 
amino terminus, or a continuous series of residues that includes the carboxyl terminus or deletion of two 

25 continuous series of residues, one including the amino terminus and one including the carboxyl 
terminus. An example of a truncated polypeptide according to the present invention is the polypetide of 
SEQ ID NO: 10 and SEQ ID NO: 12. which is encoded by the polynucleotide of SEQ ID NO: 9 
respectively SEQ ID NO: 11. Also preferred are fragments characterized by structural or functional 
attributes such as fragments that comprise alpha-helix and alpha-helix forming regions, beta-sheet and 

30 beta-sheet-forming regions, turn and turn-forming regions, coil and coil-forming regions, hydrophilic 
regions, hydrophobic regions, alpha amphipathic .regions, beta amphlpathic regions, flexible regions, 
surface-forming regions, substrate binding region, and high antigenic Index regions. Other preferred 
fragments are biologically active fragments. Biologically active fragments are those that mediate 
receptor activity, including those with a similar activity or an Improved activity, or with a decreased 

35 undesirable activity. Also included are those that are antigenic or Immunogenic in an animal, especially 
in a human. 

Thus, the polypeptides of the invention Include polypeptides having an amino acid sequence at 
least 80% identical to that of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6 or SEQ ID NO: 8 and/or the 
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polypeptide having the amino acid sequence encoded by the DNA insert contained in the deposit no. 
CBS1 02221 or the deposit no. CBS1 02222 at the Centraalbureau voor Schimmelcultures at Baarn the 
Netherlands, or fragments thereof with at least 80% identity to the corresponding fragment. Preferably, 
all of these polypeptide fragments retain the biological activity of the receptor, including antigenic 
5 activity. Variants of the defined sequence and fragments also form part of the present invention. 
Preferred variants are those that vary from the referents by conservative amino acid substitutions — i.e., 
those that substitute a residue with another of like characteristics. Typical such substitutions are among 
Ala, Val, Leu and He; among Ser and Thr; among the acidic residues Asp and Glu; among Asn and Gin; 
and among the basic residues Lys and Arg; or aromatic residues Phe and Tyr. Particularly preferred are 
10 variants in which several, 5-10. 1-5. or 1-2 amino acids are substituted, deleted, or added in any 
combination. 

With regard to the polypeptides of the present invention it was also found that they show a high 
affinity binding for neuromedin U. in particular for neuromedin U-8 (an oligopeptide of 8 amino acids), 
15 neuromedin U-23 (an oligopeptide of 23 amino acids) and/or neuromedin U-25 (an oligopeptide of 25 
amino acids). In the context of the present invention the term "high affinity" is understood as to describe 
a ligand binding showing log EC50 values of at least below -6.00 (approx. 660 nM). preferably log ECgo 
below -7.00 (approx. 55 nM), more preferably log EC50 below -9.00 (approx. 500 pM to 1 .2 nM), and 
most preferably log EC50 below -10.00 (approx, 50-100 pM). 

20 

Two forms of the neuropeptide neuromedin U, neuromedin U-8 and neuromedin U-25, are 
described in the literature as uterus stimulating and hypertensive peptides (Minamino et al., 1985. 
Biochem. Biophys. Res. Commun. 130:1078-1085) being originally isolated from the porcine spinal cord. 
For neuromedin U-23, an oligopeptide of 23 amino acids, see for example: Okimura et al., Pept. Chem. 

25 (1995), Vol. Date 1994, 32:321-324; Salmon et al,. J. Biol. Chem. (2000). 275(7), 4549-4554, 
Neuromedin U (NMU) was subsequently isolated from a number of species, e.g. from rat (NMU-23), 
human (NMU-25). frog (NMU-25). dog (NMU-8 and NMU-25), rabbit (NMU-25). and chicken (NMU-25). 
Thus, Domin et al. described the characterization of neuromedin U like immunoreactivity in rat, porcine, 
guinea pig and human tissue extracts using a specific radioimmunoassay (1986. Biochem. Biophys. 

30 Res. Commun. 140:1127-34). The primary structure of neuromedin U 23 from the rat ileum was 
established by Conlon et al. (1988. J. Neurochem. 51:988-991). Minamino et al. (1988. Biochem. 
Biophys. Res. Commun. 156:355-360) have Isolated rat neuromedin U from the small intestine using 
mainly immunoafflnity chromatography and radioimmunoassay for pig neuromedin U-8, and the amino 
acid sequence of rat neuromedin U was determined by microsequence analysis and the structure was 

35 confirmed by synthesis. Although the C-terminal heptapeptlde amide structure of pig neuromedin U is 
completely conserved in rat neuromedin U, the remainder of the peptide reveals nine amino acid 
replacements and two amino acid deletions when compared to pig neuromedin U-25. The distribution, 
primary structure, and relative biological activity of neuromedin U has been determined also in the frog 
Rana temporaria by Domin et al. (1989. J. Biol. Chem. 264:20881-20885) showing that the entire 
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sequence was found to be an icosapentapeptide which displays marked sequence similarity to both 
porcine and rat neuromedin U. In a further study Domrn et al.(1992, ReguL Pept. 41:1-8) have purified 
an avian homolog of neuromedin U from the chicken, Microsequence analysis characterized the peptide 
to be 25 amino acid residues long, and chicken neuromedin U showed marked sequence similarity with 
5 the porcine peptide at its bioactive C-terminal region. Isolation, structural characterization and 
pharmacological activity of dog neuromedin U-25 was described by O'Harte et al. (1991 Peptides 12:11- 
15). Furthermore, for rabbit neuromedin U-25 it was found that it lacks conservation of a 
posttranslational processing site (Kage et al.,1991 Regul. Pept. 33:191-198); thus, in rabbit neuromedin 
U, the Arg16-Arg17 dibasic residue processing site that is found in pig and dog neuromedin U-25 is 
10 replaced by Arg-Gly, but this potential monobasic processing site is not utilized by cleavage enzyme{s) 
in the intestine. 

Among the species studied the 5 amino acids at the C-terminus of the peptide were found to be 
almost totally conserved, suggesting that this region is of major importance. Thus, mammalian 

15 neuromedins share a common C-terminal sequence "-Phe-Leu-Phe-Arg-Pro-Arg-Asn-amide" which 
appears to be essential for its biological activities. NMU is distributed both in the gastrointestinal tract 
and the central nervous System (CNS). In the rat, the highest concentration of neuromedin (NMU) was 
found in the ileum, followed by the pituitary, hypothalamus, spinal cord, thyroid, and the genitourinary 
tract. Immunohistochemistry studies showed that NMU immunoreactivity in the gut was only found in 

20 nerve fibers, mainly in the myenteric and submucous plexuses, and in the mucosa of all areas except 
stomach while no NMU immunoreactivity was found in endocrine cells. In the rat brain. NMU 
immunoreactivity was found In fibers widespread throughout the brain with the exception of the 
cerebellum. Human and rat genes encoding NMU precursor have been isolated. Both encode NMU at 
the C-terminus and other potential peptide products in the middle (Lo et al., 1992. J. Mol. Endocrinol. 

25 6:1538-1544; Austin et ai.. 1995. J. Mol. Endocrinol. 14:157-169). High affinity NMU binding was 
characterized in rat uterus, and was shown to be sensitive to GTP--S (Nandha et al., 1993, 
Endocrinology 133:482-486). suggesting that a receptor for NMU should be a G-protein coupled 
receptor. Nevertheless, the physiological role of NMU remains largely unknown. Neuromedin U can 
cause potent contraction of smooth muscle, increase arterial blood pressure, modify intestinal ion 

30 transport, and at low doses stimulates the function and growth of the adrenal cortex. NMU was also 
shown to reduce the blood flow in superior enteric artery and portal vein while increase blood flow 
slightly in pancreatic tissue. 

Furthermore, according to the international patent application WO 90/01330 the neuromedins U-8 
and U-25 are described to be suitable in the treatment of disorders of the gastrointestinal tract, e.g. 

35 being useful in the selective reduction of blood flow to the gastrointestinal tract, in the treatment of 
gastrointestinal bleeding and postprandial hypotension. 

The IGS4 polypeptides of the present invention have been identified as a G-protein coupled 
receptor responsive to neuromedin U or ligands sufficiently similar thereto. Thus the IGS4 receptor, in 
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particular the IGS4B receptor, responsive to neuromedin U will greatly facilitate the understanding of the 
physiological mechanisms of neuromedin U and other ligands sufficiently similar thereto, as well as of 
related diseases. 

5 The tissue distribution of the polypeptides of the present invention and the expression levels are shown 
In the Figures 5-8, from which the skilled artisan can estimate the localisation and relevance of 
expression. For instance, with regard to the tissue distribution of the polypeptides of the present 
invention it was found, e.g. by MTE (multiple tissue expression) analysis, Northern blot analysis and 
Quantitative RT-PCR expression analysis that the IGS4 polypeptides of the present invention 

10 particularly are brought to expression with a medium level (relative to expression in testis as 100% in 
MTE blot, or in spinal cord as 100% in Quantitative RT-PCR analysis, respectively) e.g. in brain, skeletal 
muscle, cerebellum, thymus, medulla, thyroid, trachea, thalamus, substantia nigra, corpus callosum. 
caudate nucleus, pons, nucleus accumbens, fetal brain and stomach; and with a relevant level (if being 
detectable by Quantitative RT-PCR analysis) e.g. in heart, lung, and prostate. For instance, expression 

1 5 levels are considered to be medium if they amount at least 20% of the expression value found for the by 
far highest expression (set as 100%) in testis or spinal cord. For instance, expression levels are 
considered to be relevant if expression could be detected at least via Quantitative RT-PCR analysis. It 
will be appreciated that expression levels indicated for any organ are average values of expression 
levels in the specific tissues and cell types constituting the organ. Thus, if an expression level is just 

20 found to be relevant with respect to an organ, this does not necessarily exclude medium or even high 
expression levels locally within a specific region, e.g. In a specific tissue and/or cell type, of the organ. 

These results suggest that IGS4 polypeptides preferably play a role in the nervous system, 
including the central nervous system (CNS) and the peripheral nervous system (PNS). in the 
25 gastrointestinal system and/or in the cardiovascular system and/or in skeletal muscle and/or in the 
thyroid, and/or also in lung diseases, immunological diseases and disorders of the genitourinary system. 

Thus, in a further embodiment the invention pertains also to an isolated IGS4 polypeptide 
comprising an amino acid sequence of a neuromedin receptor protein, preferably of a mammalian 

30 neuromedin receptor protein, said protein exhibiting high affinity binding for neuromedin U, preferably for 
neuromedin U-8, for neuromedin U-23 and/or for neuromedin U-25. Particularly, the isolated IGS4 
polypeptide comprising an amino acid sequence of a neuromedin receptor protein, is a protein exhibiting 
expression (being at least detectable via Northern and/or MTE and/or Quantitative RT-PCR analysis) in 
brain, skeletal muscle, cerebellum, testis, corpus callosum. spinal cord, substantia nigra, medulla, 

35 thalamus, caudate nucleus, pons, nucleus accumbens, fetal brain, stomach, heart, thyroid gland, lung, 
thymus, prostate and/or In trachea. In a variant of this embodiment the invention pertains to an isolated 
IGS4 polypeptide comprising an amino acid sequence of a neuromedin receptor protein, preferably of a 
mammalian neuromedin receptor protein, said protein exhibiting high affinity binding for neuromedin U, 
preferably for neuromedin U-8, for neuromedin U-23 and/or for neuromedin U-25, said protein exhibiting 
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expression (being at least detectable via Northern and/or MTE and/or Quantitative RT-PCR analysis) in 
brain, skeletal muscle, cerebellum, testis, corpus callosum. spinal cord, substantia nigra, medulla, 
thalamus, caudate nucleus, pons, nucleus accumbens, fetal brain, stomach, heart, thyroid gland, lung, 
thymus, prostate and/or in trachea, and said amino acid sequence being selected from the group of 
5 amino acid sequence as already defined supra. 

The IGS4 polypeptides of the invention can be prepared in any suitable manner. Such 
polypeptides include isolated naturally occurring polypeptides, recombinantly produced polypeptides, 
synthetically produced polypeptides, or polypeptides produced by a combination of these methods. 
1 0 Methods for preparing such polypeptides are well known in the art. 

Polynucleotides of the Invention 

A further aspect of the invention relates to IGS4 polynucleotides. IGS4 polynucleotides include 
15 isolated polynucleotides which encode the IGS4 polypeptides (including IGS4A and IGS4B) and 
fragments, and polynucleotides closely related thereto. More specifically, the IGS4 polynucleotide of the 
invention includes a polynucleotide comprising the nucleotide sequence contained in SEQ ID NO: 1, 
SEQ ID NO: 3. SEQ ID NO: 5 or SEQ ID NO: 7 encoding a IGS4A polypeptide of SEQ ID NO: 2 or of 
SEQ ID NO: 4 and a IGS4B polypeptide of SEQ ID NO: 6 or of SEQ ID NO: 8 respectively. 
20 polynucleotides having the particular sequence of SEQ ID NO: 1. SEQ ID NO: 3, SEQ ID NO: 5 or SEQ 
ID NO: 7 and polynucleotides which essentially correspond to the DNA insert contained in the deposit 
no. CBS1 02221 or the deposit no. CBS1 02222 at the Centraalbureau voor Schimmelcultures at Baarn 
the Netherlands. 

25 IGS4 polynucleotides further include polynucleotides comprising a nucleotide sequence that has 

at least 80% identity over its entire length to a nucleotide sequence encoding the IGS4 polypeptide of 
SEQ ID NO: 2. SEQ ID NO: 4, SEQ ID NO: 6 or SEQ ID NO: 8. a polynucleotide comprising a 
nucleotide sequence that is at least 80% identical to that of SEQ ID NO: 1. SEQ ID NO: 3, SEQ ID NO: 
5 or SEQ ID NO: 7 over its entire length and a polynucleotide which essentially correspond to the DNA 

30 insert contained in the deposit no. CBS1 02221 or the deposit no. CBS1 02222 at the Centraalbureau 
voor Schimmelcultures at Baarn the Netherlands. 

In this regard, polynucleotides with at least 90% identity are particulariy preferred, and those with 
at least 95% are especially preferred. Furthermore, those with at least 97% are highly preferred and 
those with at least 98-99% are most highly preferred, with at least 99% being the most preferred. Also 

35 included under IGS4 polynucleotides are a nucleotide sequence which has sufficient identity to a 
nucleotide sequence contained in SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 or SEQ ID NO: 7 or to 
the DNA insert contained in the deposit no. CBS1 02221 or in the deposit no. CBS1 02222 at the 
Centraalbureau voor Schimmelcultures at Baarn the Netherlands to hybridize under conditions useable 
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for amplification or for use as a probe or marker. The invention also provides polynucleotides which are 
complementary to such IGS4 polynucleotides. 

IGS4 of the Invention is structurally related to other proteins of the G-protein coupled receptor 
family, as shown by the results of BLAST searches in the public databases. The amino acid sequence of 
Table 1 (SEQ ID NO: 2) has about 46 % identity (using BLAST. Altschul S.F. et al. [1997], Nucleic Acids 
Res. 25:3389-3402) over most of its length (316 amino acid residues) with a human orphan G-protein 
coupled receptor (Accession # 043664. Tan et a!.. Genomics 52(2):223-229 (1998). There is 27 % 
homology (over amino acid residues 61-349 ) to the rat neurotensin 1 receptor (Accession # P20789 
Tanaka K.et al. Neuron 4:847-854 (1990)). The nucleotide sequence of Table 1 (SEQ ID NO: 1) is 63 % 
identical to an orphan G-protein coupled receptor over nucleotide residues 120-864 (Accession # 
AF044600. corresponding with the protein sequence 043664). Furthermore, there is 53 % identity to the 
human growth hormone secretagogue receptor over residues 94-1137 (Howard A.D.et al. Science 
273:974-977(1996)). Thus, IGS4 polypeptides and polynucleotides of the present invention are 
expected to have, inter alia, similar biological functions/properties to their homologous polypeptides and 
polynucleotides, and their utility is obvious to anyone skilled in the art. 

Polynucleotides of the invention can be obtained from natural sources such as genomic DNA. In 
particular, degenerated PGR primers can be designed that encode conserved regions within a particular 
GPCR gene subfamily. PGR amplification reactions on genomic DNA or cDNA using the degenerate 
primers will result in the amplification of several members (both known and novel) of the gene family 
under consideration (the degenerated primers must be located within the same exon, when a genomic 
template is used). (Libert et al., Science, 1989. 244: 569-572). Polynucleotides of the invention can also 
be synthesized using well-known and commercially available techniques (e.g. F.M. Ausubel et al, 2000. 
Current Protocols in Molecular Biology). 

The nucleotide sequence encoding the IGS4 polypeptide of SEQ ID NO: 2, SEQ ID NO: 4. SEQ 
ID NO: 6 or SEQ ID NO: 8 may be identical to the polypeptide encoding sequence contained in SEQ ID 
NO: 1 (nucleotide number 55 to 1299) or SEQ ID NO: 3 (nucleotide number 64 to 1299). or SEQ ID NOi 
5 (nucleotide number 55 to 1299) or SEQ ID NO: 7 (nucleotide number 64 to 1299) respectively, or it 
may be a different nucleotide sequence, which as a result of the redundancy (degeneracy) of the 
genetic code might also show alterations compared to the polypeptide encoding sequence contained in 
SEQ ID NO: 1. SEQ ID NO: 3, SEQ ID NO: 5 or SEQ ID NO: 7, but also encodes the polypeptide of 
SEQ ID NO: 2. SEQ ID NO: 4, SEQ ID NO: 6 or SEQ ID NO: 8, respectively. 

In a further embodiment the invention pertains to an isolated nucleotide sequence encoding an 
IGS4 neuromedin receptor protein, preferably encoding a mammalian neuromedin receptor protein, said 
protein exhibiting high affinity binding for neuromedin U, preferably for neuromedin U-8, for neuromedin 
U-23 and/or for neuromedin U-25. Particularly, the isolated nucleotide sequence encodes an IGS4 
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neuromedin receptor protein which is a protein exhibiting expression (being at least detectable via 
Northern and/or MTE and/or Quantitative RT-PCR analysis) in brain, skeletal muscle, cerebellum, testis, 
corpus callosum. spinal cord, substantia nigra, medulla, thalamus, caudate nucleus, pons, nucleus 
accumbens. fetal brain, stomach, heart, thyroid gland, lung, thymus, prostate and/or In trachea. In a 
5 variant of this embodiment the invention pertains to an isolated nucleotide sequence encoding an IGS4 
neuromedin receptor protein, preferably encoding a mammalian neuromedin receptor protein, said 
protein exhibiting high affinity binding for neuromedin U, preferably for neuromedin U-8. for neuromedin 
U-23 and/or for neuromedin U-25, said protein exhibiting expression (being at least detectable via 
Northern and/or MTE and/or Quantitative RT-PCR analysis) in brain, skeletal muscle, cerebellum, testis, 
10 corpus callosum, spinal cord, substantia nigra, medulla, thalamus, caudate nucleus, pons, nucleus 
accumbens, fetal brain, stomach, heart, thyroid gland, lung, thymus, prostate and/or in trachea, and said 
nucleotide sequence being selected from the group of nucleotide sequences as already defined supra. 

When the polynucleotides of the invention are used for the recombinant production of the IGS4 
15 polypeptide, the polynucleotide may include the coding sequence for the mature polypeptide or a 
fragment thereof, by itself; the coding sequence for the mature polypeptide or fragment in reading frame 
with other coding sequences, such as those encoding a leader or secretory sequence, a pre-, or pro- or 
prepro- protein sequence, or other fusion peptide portions. For example, a marker sequence which 
facilitates purification of the fused polypeptide can be encoded. In certain preferred embodiments of this 
20 aspect of the invention, the marker sequence is a hexa-histidine peptide, as provided in the pQE vector 
(Qiagen, Inc.) and described in Gentz et aL, Proc. Natl. Acad. Sci USA (1989) 86:821-824, or is an HA 
tag. The polynucleotide may also contain non-coding 5* and 3' sequences, such as transcribed, non- 
translated sequences, splicing and polyadenylation signals, ribosome binding sites and sequences that 
stabilize mRNA. 

25 

Further preferred embodiments are polynucleotides encoding IGS4 variants comprising the amino 
acid sequence of the IGS4 polypeptide of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6 or SEQ ID NO: 
8 in which several, 5-10, 1-5. 1-3. 1-2 or 1 amino acid residues are substituted, deleted or added, in any 
combination. 

30 

The polynucleotides of the invention can be engineered using methods generally known in the 
art in order to alter IGS4-encoding sequences for a variety of purposes including, but not limited to, 
modification of the cloning, processing, and/or expression of the gene product. DNA shuffling by random 
fragmentation and PGR reassembly of gene fragments and synthetic oligonucleotides may be used to 
35 engineer the nucleotide sequences. For example, oligonucleotide-mediated site-directed mutagenesis 
may be used to introduce mutations that create amino acid substitutions, create new restriction sites, 
alter modification (e.g. glycosyiation or phosphorylation) patterns, change codon preference, produce 
splice variants, and so forth. 
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The present invention further relates to polynucleotides that hybridize to the herein above- 
described sequences. In this regard, the present invention especially relates to polynucleotides which 
hybridize under stringent conditions to the polynucleotides described above. As herein used, the term 
•"stringent conditions" means hybridization will occur only if there is at least 80%, and preferably at least 
5 90%, and more preferably at least 95%. yet even more preferably at least 97%. in particular at least 
99% identity between the sequences. 

Polynucleotides of the invention, which are identical or sufficiently identical to a nucleotide 
sequence contained in SEQ ID NO: 1 , SEQ ID NO: 3. SEQ ID NO: 5 or SEQ ID NO: 7 or a fragment 

10 thereof, may be used as hybridization probes for cDNA and genomic DNA. to isolate fulf-length cDNAs 
and genomic clones encoding IGS4 and to isolate cDNA and genomic clones of other genes (including 
genes encoding homologs and orthologs from species other than human) that have a high sequence 
similarity to the IGS4 gene. People skilled in the art are well aware of such hybridization techniques. 
Typically these nucleotide sequences are 80% identical, preferably 90% identical, more preferably 95% 

15 identical to that of the referent. The probes generally will comprise at least 5 nucleotides, and preferably 
at least 8 nucleotides, and more preferably at least 10 nucleotides, yet even more preferably at least 12 
nucleotides, in particular at least 15 nucleotides. Most preferred, such probes will have at least 30 
nucleotides and may have at least 50 nucleotides. Particulariy preferred probes will range between 30 
and 50 nucleotides. 

20 

One embodiment, to obtain a polynucleotide encoding the IGS4 polypeptide, including homologs 
and orthologs firom species other than human, comprises the steps of screening an appropriate library 
under stringent hybridization conditions with a labeled probe having the SEQ ID NO: 1. SEQ ID NO: 3. 
SEQ ID NO: 5 or SEQ ID NO: 7 or a fragment thereof, and isolating full-length cDNA and genomic 

25 clones containing said polynucleotide sequence. Such hybridization techniques are well known to those 
of skill in the art. Stringent hybridization conditions are as defined above or alternatively conditions under 
overnight incubation at 42 in a solution comprising: 50% fomnamide. 5xSSC (150mM NaCI. 15mM 
trisodium citrate). 50 mM sodium phosphate (pH7.6). 5x Denhardt's solution, 10 % dextran sulfate (w/v), 
and 20 microgram/ml denatured, sheared salmon sperm DNA. followed by washing the filters in 

30 O.lxSSC at about 65°C. 

The polynucleotides and polypeptides of the present invention may be used as research reagents 
and materials for discovery of treatments and diagnostics to animal and human disease. 

35 Vectors, Host Cells, Expression 

The present invention also relates to vectors which comprise a polynucleotide or polynucleotides 
of the present invention, and host cells which are genetically engineered with vectors of the invention 
and to the production of polypeptides of the invention by recombinant techniques. Cell-free translation 
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systems can also be used to produce such proteins using RNAs derived from the DNA constructs of the 
present invention. 

For recombinant production, host cells can be genetically engineered to incorporate expression 
5 systems or portions thereof for polynucleotides of the present invention. Introduction of polynucleotides 
into host cells can be effected by methods described in many standard laboratory manuals, such as 
Davis et al.. BASIC METHODS IN MOLECULAR BIOLOGY (1986) and Sambrook et al., MOLECULAR 
CLONING: A LABORATORY MANUAL, 2nd Ed., Cold Spring Harbor Laboratory Press. Cold Spring 
Harbor, N.Y. (1989) such as calcium phosphate transfection, DEAE-dextran mediated transfection, 
10 transvection, microinjection, cationic llpid-mediated transfection, electroporation, transduction, scrape 
loading, ballistic introduction or infection. 

Representative examples of appropriate hosts include bacterial cells, such as streptococci, 
staphylococci, E. coli. Streptomyces and Bacillus subtilis cells; fungal cells, such as yeast ceils and 
15 Aspergillus cells; insect cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as 
CHO, COS. HeLa, C127. 3T3, BHK. HEK 293 and Bowes melanoma cells; and plant cells. 

A great variety of expression systems can be used. Such systems include, among others, 
chromosomal, episomal and virus-derived systems, e.g.. vectors derived from bacterial plasmids, from 

20 bacteriophage, from transposons, from yeast episomes. from insertion elements, from yeast 
chromosomal elements, from viruses such as baculoviruses. papova viruses, such as SV4D, vaccinia 
viruses, adenoviruses, fowl pox viruses, pseudorabies viruses and retroviruses, and vectors derived 
from combinations thereof, such as those derived from plasmid and bacteriophage genetic elements, 
such as cosmids and phagemids. The expression systems may contain control regions that regulate as 

25 well as engender expression. Generally, any system or vector suitable to maintain, propagate or 
express polynucleotides to produce a polypeptide in a host may be used. The appropriate nucleotide 
sequence may be inserted into an expression system by any of a variety of well-known and routine 
techniques, such as, for example, those set forth in Sambrook et al.. MOLECULAR CLONING. A 
LABORATORY MANUAL (supra). 

30 

For secretion of the translated protein into the lumen of the endoplasmic reticulum, into the 
periplasmic space or into the extracellular environment, appropriate secretion signals may be 
incorporated into the desired polypeptide. These signals may be endogenous to the polypeptide or they 
may be heterologous signals, i.e. derived from a different species. 

35 

If the IGS4 polypeptide is to be expressed for use in screening assays, generally, it is preferred 
that the polypeptide be produced at the surface of the cell. In this event, the cells may be harvested prior 
to use in the screening assay. In case the affinity or functional activity of the IGS4 polypeptide is 
modified by receptor activity modifying proteins (RAMP), coexpression of the relevant RAMP most likely 
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at the surface of the cell is preferred and often required. Also in this event harvesting of cells expressing 
the IGS4 polypeptide and the relevant RAMP prior to use in screening assays is required. If the IGS4 
polypeptide is secreted into the medium, the medium can be recovered in order to recover and purify the 
polypeptide; if produced intracellularly, the cells must first be lysed before the polypeptide is recovered. 
5 Membranes expressing the IGS4 polypeptide can be recovered by methods that are well known to a 
person skilled in the art. In general, such methods Include harvesting of the cells expressing the IGS4 
polypeptide and homogenization of the cells by a method such as, but not limited to, pottering. The 
membranes may be recovered by washing the suspension one or several times. 

10 IGS4 polypeptides can be recovered and purified from recombinant cell cultures by well-known 

methods including ammonium sulfate or ethanol precipitation, acid extraction, anion or cation exchange 
chromatography, phosphoceliulose chromatography, hydrophobic interaction chromatography, affinity 
chromatography, hydroxylapatite chromatography and lectin chromatography. Most preferably, high 
performance liquid chromatography is employed for purification. Well-known techniques for refolding 

1 5 proteins may be employed to regenerate active conformation when the polypeptide is denatured during 
isolation and or purification. 

Diagnostic Assays 

20 This invention also relates to the use of IGS4 polynucleotides for use as diagnostic reagents. Detection 
of a mutated form of the IGS4 gene associated with a dysfunction will provide a diagnostic tool that can 
add to or define a diagnosis of a disease or susceptibility to a disease which results from under- 
expression, over-expression or altered expression of IGS4. Also in this event co-expression of relevant 
receptor activity modifying proteins can be required to obtain diagnostic assays of desired quality. 

25 Individuals carrying mutations in the IGS4 gene may be detected at the DNA level by a variety of 
techniques. 

Nucleic acids for diagnosis may be obtained from a subject's cells, such as from blood, urine, 
saliva, tissue biopsy or autopsy material. The genomic DNA may be used directly for detection or may 

30 be amplified enzymatically by using PGR or other amplification techniques prior to analysis. RNA or 
cDNA may also be used in similar fashion. Deletions and insertions can be detected by a change in size 
of the amplified product in comparison to the normal genotype. Point mutations can be identified by 
hybridizing amplified DNA to labeled IGS4 nucleotide sequences. Perfectly matched sequences can be 
distinguished from mismatched duplexes by RNase digestion or by differences in melting temperatures. 

35 DNA sequence differences may also be detected by alterations in electrophoretic mobility of DNA 
fragments in gels, with or without denaturing agents, or by direct DNA sequencing. See, e.g., Myers et 
al.. Science (1985) 230:1242. Sequence changes at specific locations may also be revealed by 
nuclease protection assays, such as RNase and S1 protection or the chemical cleavage method. See 
Cotton et al., Proc. Natl. Acad. Sci. USA (1985) 85: 4397-4401. In another embodiment, an array of 
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oligonucleotide probes comprising the IGS4 nucleotide sequence or fragments thereof can be 
constructed to conduct efficient screening of e.g., genetic mutations. Array technology methods are well 
known and have general applicability and can be used to address a variety of questions in molecular 
genetics including gene expression, genetic linkage, and genetic variability. (See for example: M.Chee 
5 et al.. Science. Vol 274, pp 610-613 (1996)). 

The diagnostic assays offer a process for diagnosing or determining a susceptibility to PNS, 
psychiatric and CNS disorders, including schizophrenia, episodic paroxysmal anxiety (EPA) disorders 
such as obsessive compulsive disorder (OCD). post traumatic stress disorder (PTSD), phobia and 

10 panic, major depressive disorder, bipolar disorder, Parkinson's disease, general anxiety disorder, 
autism, delirium, multiple sclerosis, Alzheimer disease/dementia and other neurodegenerative diseases, 
severe mental retardation, dyskinesias, Huntington's disease, Touretfs syndrome, tics, tremor, dystonia, 
spasms, anorexia, bulimia, stroke, addiction/dependency/craving, sleep disorder, epilepsy, migraine; 
attention deficit/hyperactivity disorder (ADHD); cardiovascular diseases, including heart failure, angina 

15 pectoris, arrhythmias, myocardial infarction, cardiac hypertrophy, hypotension, hypertension - e.g. 
essential hypertension, renal hypertension, or pulmonary hypertension, thrombosis, arteriosclerosis, 
cerebral vasospasm, subarachnoid hemorrhage, cerebral ischemia, cerebral infarction, peripheral 
vascular disease, Raynaud's disease, kidney disease - e.g. renal failure; dyslipidemias; obesity: 
emesis; gastrointestinal disorders, including irritable bowel syndrome (IBS), inflammatory bowel disease 

20 (IBD), gastroesophagal reflux disease (GERD), motility disorders and conditions of delayed gastric 
emptying, such as post operative or diabetic gastroparesis. and diabetes, ulcers - e.g. gastric ulcer; 
diarrhoea; other diseases including osteoporosis; inflammations; infections such as bacterial, fungal, 
protozoan and viral infections, particularly infections caused by HIV-1 or HlV-2; pain; cancers; 
chemotherapy induced injury; tumor invasion; immune disorders; urinary retention; asthma; allergies; 

25 arthritis; benign prostatic hypertrophy; endotoxin shock; sepsis; complication of diabetes mellitus; and 
gynaecological disorders, through detection of mutation in the IGS4 gene by the methods described. 
According to the present invention, the diagnostic assays offer in particular a process for diagnosing or 
determining a susceptibility to disorders of the nervous system, including the central nervous system 
(CNS) and the peripheral nervous system (PNS), disorders of the gastrointestinal system and/or of the 

30 cardiovascular system and/or of skeletal muscle and/or of the thyroid, and/or also to lung diseases, 
immunological diseases and disorders of the genitourinary system. 

In addition, PNS, psychiatric and CNS disorders, including schizophrenia, episodic paroxysmal 
anxiety (EPA) disorders such as obsessive compulsive disorder (OCD). post traumatic stress disorder 
35 (PTSD). phobia and panic, major depressive disorder, bipolar disorder. Parkinson's disease, general 
anxiety disorder, autism, delirium, multiple sclerosis. Alzheimer disease/dementia and other 
neurodegenerative diseases, severe mental retardation, dyskinesias, Huntington's disease, Touretfs 
syndrome, tics, tremor, dystonia, spasms, anorexia, bulimia, stroke, addiction/dependency/craving, 
sleep disorder, epilepsy, migraine; attention deficit/hyperactivity disorder (ADHD); cardiovascular 
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diseases. including heart failure, angina pectoris, arrhythnnias. myocardial infarction, cardiac 
hypertrophy, hypotension, hypertension - e.g. essential hypertension, renal hypertension, or pulmonary 
hypertension, thrombosis, arteriosclerosis, cerebral vasospasm, subarachnoid hemorrhage, cerebral 
ischemia, cerebral infarction, peripheral vascular disease. Raynaud's disease, kidney disease - e.g. 
5 renal failure; dysllpidemias; obesity; emesis; gastrointestinal disorders, including irritable bowel 
syndrome (IBS), Inflammatory bowel disease (IBD), gastroesophagal reflux disease (GERD). motility 
disorders and conditions of delayed gastric emptying, such as post operative or diabetic gastroparesis, 
and diabetes, ulcers - e.g. gastric ulcer; diarrhoea; other diseases including osteoporosis; 
inflammations; infections such as bacterial, fungal, protozoan and viral infections, particularly infections 

10 caused by HIV-1 or HIV-2; pain; cancers; chemotherapy Induced injury; tumor invasion; immune 
disorders; urinary retention; asthma; allergies; arthritis; benign prostatic hypertrophy; endotoxin shock; 
sepsis; complication of diabetes mellltus; and gynaecological disorders, can be diagnosed by methods 
comprising determining from a sample derived from a subject an abnormally decreased or increased 
level of the IGS4 polypeptide or IGS4 mRNA. In particular disorders of the nervous system. Including the 

15 central nervous system (CNS) and the peripheral nervous system (PNS). disorders of the 
gastrointestinal system and/or of the cardiovascular system and/or of skeletal muscle and/or of the 
thyroid, and/or also lung diseases, immunological diseases and disorders of the genitourinary system 
can be diagnosed by methods comprising determining from a sample derived from a subject an 
abnomrially decreased or increased level of the IGS4 polypeptide or IGS4 mRNA. Decreased or 

20 increased expression can be measured at the RNA level using any of the methods well known in the art 
for the quantitation of polynucleotides, such as, for example. PGR. RT-PGR, RNase protection. Northern 
blotting and other hybridization methods. Assay techniques that can be used to determine levels of a 
protein, such as an IGS4, in a sample derived from a host are well known to those of skill in the art. 
Such assay methods Include radioimmunoassays, competitive-binding assays. Western Blot analysis 

25 and ELISA assays. 

In another aspect, the present invention relates to a diagonostic kit for a disease or suspectibility 
to a disease, particulariy PNS. psychiatric and CNS disorders, including schizophrenia, episodic 
paroxysmal anxiety (EPA) disorders such as obsessive compulsive disorder (OCD), post traumatic 

30 stress disorder (PTSD). phobia and panic, major depressive disorder, bipolar disorder, Parkinson's 
disease, general anxiety disorder, autism, delirium, multiple sclerosis, /Mzhetmer disease/dementia and 
other neurodegenerative diseases, severe mental retardation, dyskinesias, Huntington's disease. 
Tourett's syndrome, tics, tremor, dystonia. spasms. anorexia, bulimia, stroke, 
addiction/dependency/craving, sleep disorder, epilepsy, migraine; attention deficit/hyperactivity disorder 

35 (ADHD); cardiovascular diseases, including heart failure, angina pectoris, arrhythmias, myocardial 
infarction, cardiac hypertrophy, hypotension, hypertension - e.g. essential hypertension, renal 
hypertension, or pulmonary hypertension, thrombosis, arteriosclerosis, cerebral vasospasm, 
subarachnoid hemorrhage, cerebral ischemia, cerebral in^rction, peripheral vascular disease, 
Raynaud's disease, kidney disease - e.g. renal failure; dysllpidemias; obesity; emesis; gastrointestinal 
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disorders. including irritable bowel syndrome (IBS), inflammatory bowel disease (IBD). gastroesophaga! 
reflux disease (GERD), motility disorders and conditions of delayed gastric emptying, such as post 
operative or diabetic gastroparesis, and diabetes, ulcers - e.g. gastric ulcer; diarrhoea; other diseases 
Including osteoporosis; inflammations; infections such as bacterial, fungal, protozoan and viral 
infections, particularly Infections caused by HIV-1 or HIV-2; pain; cancers; chemotherapy induced injury; 
tumor invasion; immune disorders; urinary retention; asthma; allergies; arthritis; benign prostatic 
hypertrophy; endotoxin shock; sepsis; complication of diabetes mellltus; and gynaecological disorders, 
which comprises: 

(a) an IGS4 polynucleotide, preferably the nucleotide sequence of SEQ ID NO: 1 . SEQ ID NO: 3, 

SEQ ID NO: 5 or SEQ ID NO: 7, or a fragment thereof; and/or 

(b) a nucleotide sequence complementary to that of (a); and/or 

(c) an IGS4 polypeptide, preferably the polypeptide of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID 
NO: 6 or of SEQ ID NO: 8. or a fragment thereof; and/or 

(d) an antibody to an IGS4 polypeptide, preferably to the polypeptide of SEQ ID NO: 2, SEQ ID 
NO: 4, SEQ ID NO: 6 or of SEQ ID NO: 8; and/or 

(e) a RAMP polypeptide required for the relevant biological or antigenic properties of an IGS4 
polypeptide 

It will be appreciated that in any such kit, (a), (b). (c) (d) or (e) may comprise a substantial component. 
Preferably the present invention relates to a diagnostic kit for diagnosing or determining a disease or a 
susceptibility to a disease of the nervous system, including the central nervous system (CNS) and the 
peripheral nervous system (PNS). a disease of the gastrointestinal system and/or of the cardiovascular 
system and/or of skeletal muscle and/or of the thyroid, and/or also lung diseases, immunological 
diseases and disorders of the genitourinary system. 

Chromosome Assays 

The nucleotide sequences of the present invention are also valuable for chromosome 
identification. The sequence is specifically targeted to and can hybridize with a particular location on an 
individual human chromosome. The mapping of relevant sequences to chromosomes according to the 
present invention Is an important first step in correlating those sequences with gene associated disease. 
Once a sequence has been mapped to a precise chromosomal location, the physical position of the 
sequence on the chromosome can be correlated with genetic map data. Such data are found, for 
example, in V. McKusick, Mendelian Inheritance in Man (available on line through Johns Hopkins 
University Welch Medical Library). The relationship between genes and diseases that have been 
mapped to the same chromosomal region are then identified through linkage analysis (coinheritance of 
physically adjacent genes). 
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The differences in the cDNA or genomic sequence between affected and unaffected individuals 
can also be detemnined. If a mutation is observed in some or all of the affected individuals but not in any 
normal individuals, then the mutation is likely to be the causative agent of the disease. 
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The polypeptides of the invention or their fragments or analogs thereof, or cells expressing them If 
required together with relevant RAMP's. may also be used as immunogens to produce antibodies 
immunospecific for the IGS4 polypeptides. The term "immunospecific" means that the antibodies have 
substantiall greater affinity for the polypeptides of the invention than their affinity for other related 
polypeptides in the prior art. 

Antibodies generated against the IGS4 polypeptides may be obtained by administering the 
polypeptides or epitope-bearing fragments, analogs or cells to an animal, preferably a nonhuman, using 
routine protocols. For preparation of monoclonal antibodies, any technique, which provides antibodies 
produced by continuous cell line cultures, may be used. Examples include the hybridoma technique 
(Kohler, G. and Milstein, C, Naure (1975) 256:495-497), the trioma technique, the human B-cell 
hybridoma technique (Kozbor et al., Immunology Today (1983) 4:72) and the EBV-hybridoma technique 
(Cole et al.. MONOCLONAL ANTIBODIES AND CANCER THERAPY, pp. 77-96. Alan R. Uss, Inc., 
1985). 

The above-described antibodies may be employed to isolate or to identify clones expressing the 
polypeptide or to purify the polypeptides by affinity chromatography. 

Antibodies against IGS4 polypeptides as suchTor against IGS4 polypeptide-RAMP complexes, may also 
be employed to treat PNS, psychiatric and CNS disorders, including schizophrenia, episodic paroxysmal 
anxiety (EPA) disorders such as obsessive compulsive disorder (OCD), post traumatic stress disorder 
(PTSD), phobia and panic, major depressive disorder, bipolar disorder, Parkinson's disease, general 
anxiety disorder, autism, delirium, multiple sclerosis, Alzheimer disease/dementia and other 
neurodegenerative diseases, severe mental retardation, dyskinesias, Huntington's disease, Tourett's 
syndrome, tics, tremor, dystonia, spasms, anorexia, bulimia, stroke, addiction/dependency/craving, 
sleep disorder, epilepsy, migraine; attention deficit/hyperactivity disorder (ADHD); cardiovascular 
diseases, including heart failure, angina pectoris, arrhythmias, myocardial infarction, cardiac 
hypertrophy, hypotension, hypertension - e.g. essentia! hypertension, renal hypertension, or pulmonary 
hypertension, thrombosis, arteriosclerosis, cerebral vasospasm, .subarachnoid hemorrhage, cerebral 
ischemia, cerebral infarction, peripheral vascular disease. Raynaud's disease, kidney disease - e.g. 
renal failure; dyslipidemias; obesity; emesis; gastrointestinal disorders, including irritable bowel 
syndrome (IBS), inflammatory bowel disease (IBD), gastroesophagal reflux disease (GERD), motility 
disorders and conditions of delayed gastric emptying, such as post operative or diabetic gastroparesis, 
and diabetes, ulcers - e.g. gastric ulcer; diarrhoea; other diseases including osteoporosis; 
inflammations: infections such as bacterial, fungal, protozoan and viral infections, particularly infections 
caused by HIV-1 or HIV-2; pain; cancers; chemotherapy induced injury; tumor invasion; immune 
disorders; urinary retention; asthma; allergies; arthritis; benign prostatic hypertrophy; endotoxin shock; 
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sepsis; complication of diabetes mellitus; and gynaecological disorders, among others. Preferably the 
antibodies of the present invention may be used to treat disorders of the nervous system, including the 
central nervous system (CNS) and the peripheral nervous system (PNS), disorders of the 
gastrointestinal system and/or of the cardiovascular system and/or of skeletal muscle and/or of the 
thyroid, and/or also to treat lung diseases, immunological diseases and disorders of the genitourinary 
system. 

Animals 



1 0 Another aspect of the invention relates to non-human animal-based systems which act as models 

for disorders arising from aberrant expression or activity of IGS4. Non-human animal-based model 
systems may also be used to further characterize the activity of the IGS4 gene. Such systems may be 
utilized as part of screening strategies designed to identify compounds \Nh\dh are capable to treat IGS4 
based disorders such as PNS, psychiatric and CNS disorders, including schizophrenia, episodic 

15 paroxysmal anxiety (EPA) disorders such as obsessive compulsive disorder (OGD), post traumatic 
stress disorder (PTSD), phobia and panic, major depressive disorder, bipolar disorder. Parkinson's 
disease, general anxiety disorder, autism, delirium, multiple sclerosis, Alzheimer disease/dementia and 
other neurodegenerative diseases, severe mental retardation, dyskinesias, Huntington's disease. 
Tourett's syndrome, tics, tremor, dystonia, spasms, anorexia, bulimia, stroke, 

20 addiction/dependency/craving, sleep disorder, epilepsy, migraine; attention deficit/hyperactivity disorder 
(ADHD); cardiovascular diseases, including heart failure, angina pectoris, arrhythmias, myocardial 
Infarction, cardiac hypertrophy, hypotension, hypertension - e.g. essential hypertension, renal 
hypertension, or pulmonary hypertension, thrombosis, arteriosclerosis, cerebral vasospasm, 
subarachnoid hemorrhage, cerebral ischemia, cerebral infarction, peripheral vascular disease, 

25 Raynaud's disease, kidney disease - e.g. renal failure; dyslipidemias; obesity; emesis; gastrointestinal 
disorders, including irritable bowel syndrome (IBS), inflammatory bowel disease (IBD), gastroesophagal 
reflux disease (GERD), motility disorders and conditions of delayed gastric emptying, such as post 
operative or diabetic gastroparesis, and diabetes, ulcers - e.g. gastric ulcer; diarrhoea; other diseases 
including osteoporosis; inflammations; infections such as bacterial, fungal, protozoan and viral 

30 infections, particularly infections caused by HIV-1 or HIV-2; pain; cancers; chemotherapy induced injury; 
tumor invasion; immune disorders; urinary retention; asthma; allergies; arthritis; benign prostatic 
hypertrophy; endotoxin shock; sepsis; complication of diabetes mellitus; and gynaecological disorders. 
In particular, the systems may be utilized as part of screening strategies designed to identify compounds 
which are capable in particular to treat IGS4 based disorders of the nervous system, including the 

35 central nervous system (CNS) and the peripheral nervous system (PNS), disorders of the 
gastrointestinal system and/or of the cardiovascular system and/or of skeletal muscle and/or of the 
thyroid, and/or also to treat lung diseases, immunological diseases and disorders of the genitourinary 
system. In this way the animal-based models may be used to identify pharmaceutical compounds, 
therapies and interventions which may be effective in treating disorders of aberrant expression or 
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activity of IGS4. In addition such animal models may be used to determine the LD50 and the ED50 In 
animal subjects. These data may be used to determine the in vivo efficacy of potential IGS4 disorder 
treatments. 

Animal-based model systems of IGS4 based disorders, based on aberrant IGS4 expression or 
5 activity, may include both non-recombinant animals as well as recombinantly engineered transgenic 
animals. 

Animal models for IGS4 disorders may include, for example, genetic models. Animal models 
exhibiting IGS4 based disorder-like symptoms may be engineered by utilizing, for example. IGS4 
10 sequences such as those described, above, in conjunction with techniques for producing transgenic 
animals that are well known to persons skilled in the art. For example. IGS4 sequences may be 
introduced into, and overexpressed and/or misexpressed in, the genome of the animal of interest, or, if 
endogenous IGS4 sequences are present, they may either be overexpressed, misexpressed, or, 
alternatively, may be disrupted in order to underexpress or inactivate IGS4 gene expression. 

15 

In order to overexpress or misexpress a IGS4 gene sequence, the coding portion of the IGS4 
gene sequence may be ligated to a regulatory sequence which Is capable of driving high level gene 
expression or expression in a cell type in which the gene is not normally expressed in the animal type of 
interest. Such regulatory regions will be well known to those skilled in the art, and may be utilized in the 
20 absence of undue experimentation. 

For underexpression of an endogenous IGS4 gene sequence, such a sequence may be isolated 
and engineered such that when reintroduced into the genome of the animal of interest, the endogenous 
IGS4 gene alleles will be inactivated, or "knocked-out". Preferably, the engineered IGS4 gene sequence 
25 is introduced via gene targeting such that the endogenous IGS4 sequence is disrupted upon integration 
of the engineered IGS4 gene sequence into the animal's genome. Gene targeting is discussed, below, 
in this section. 

Animals of any species, Including, but not limited to, mice, rats, rabbits, squirrels, guinea-pigs, 
30 pigs, micro-pigs, goats, and non-human primates, e.g., baboons, monkeys, and chimpanzees may be 
used to generate animal models of IGS4 related disorders. 

Any technique known in the art may be used to introduce a IGS4 transgene Into animals to 
produce the founder lines of transgenic animals. Such techniques include, but are not limited to 
35 pronuclear microinjection (Hoppe. P.C. and Wagner, T.E., 1989, U.S. Pat. No. 4,873.191); retrovirus 
mediated gene transfer into germ lines (van der Putten et al., Proc. Natl. Acad. Sci., USA 82:6148-6152, 
1985); gene targeting in embryonic stem cells (Thompson et a!.. Cell 56:313-321. 1989,); electroporation 
of embryos (Lo, Mol. Cell. Biol. 3:1803-1B14, 1983); and sperm-mediated gene transfer (Lavitrano et al.. 
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Cell 57:717-723. 1989); etc. For a review of such techniques, see Gordon, Transgenic Animals. Intl. 
Rev. Cytol.1 15:171-229, 1989. which is incorporated by reference herein in its entirety. 

The present invention provides for transgenic animals that carry the IGS4 transgene in all their 
5 cells, as well as animals which carry the transgene in some, but not all their cells, L^, mosaic animals. 
(See, for example, techniques described by Jakobovits, Curr. Biol. 4:761-763, 1994) The transgene may 
be integrated as a single transgene or in concatamers, e.g., head-to-head tandems or head-to-tail 
tandems. The transgene may also be selectively Introduced into and activated in a particular cell type by 
following, for example, the teaching of Lasko et al. (Lasko, M. et a!., Proc. Natl. Acad. Sci. USA 89:6232- 
10 6236, 1992). 

The regulatory sequences required for such a cell-type specific activation will depend upon the 
particular cell type of interest, and will be apparent to those of skill in the art. 

15 When it is desired that the IGS4 transgene be integrated into the chromosomal site of the 

endogenous IGS4 gene, gene targeting is preferred. Briefly, when such a technique is to be utilized, 
vectors containing some nucleotide sequences homologous to the endogenous IGS4 gene of interest 
(e.g., nucleotide sequences of the mouse IGS4 gene) are designed for the purpose of integrating, via 
homologous recombination with chromosomal sequences, into and disrupting the function of, the 

20 nucleotide sequence of the endogenous IGS4 gene or gene allele. The transgene may also be 
selectively introduced into a particular cell type, thus inactivating the endogenous gene of interest in only 
that cell type, by following, for example, the teaching of Gu et al. (Gu, H. et al.. Science 265:103-106, 
1994). The regulatory sequences required for such a cell-type specific inactivation will depend upon the 
particular cell type of interest, and will be apparent to those of skill in the art. 

25 

Once transgenic animals have been generated, the expression of the recombinant IGS4 gene 
and protein may be assayed utilizing standard techniques. Initial screening may be accomplished by 
Southern blot analysis or PGR techniques to analyze animal tissues to assay whether integration of the 
transgene has taken place. The level of mRNA expression of the IGS4 transgene in the tissues of the 

30 transgenic animals may also be assessed using techniques which include but are not limited to Northern 
blot analysis of tissue samples obtained from the animal, in situ hybridization analysis, and RT-PCR. 
Samples of target gene-expressing tissue, may also be evaluated immunocytochemically using 
antibodies specific for the target gene transgene product of interest. The IGS4 transgenic animals that 
express IGS4 gene mRNA or IGS4 transgene peptide (detected immunocytochemically, using 

35 antibodies directed against target gene product epitopes) at easily detectable levels may then be further 
evaluated to identify those animals which display characteristic IGS4 based disorder symptoms. 



Once IGS4 transgenic founder animals are produced (i.e. . those animals which express IGS4 
proteins in cells or tissues of interest, and which, preferably, exhibit symptoms of IGS4 based disorders). 
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they may be bred, inbred, outbred, or crossbred to produce colonies of the particular aninnal. Examples 
of such breeding strategies include but are not limited to: outbreeding of founder animals with more than 
one integration site in order to establish separate lines; inbreeding of separate lines in order to produce 
compound IGS4 transgenics that express the IGS4 transgene of interest at higher levels because of the 
5 effects of additive expression of each IGS4 transgene; crossing of heterozygous transgenic animals to 
produce animals homozygous for a given integration site in order to both augment expression and 
eliminate the possible need for screening of animals by DNA analysis; crossing of separate homozygous 
lines to produce compound heterozygous or homozygous lines; breeding animals to different inbred 
genetic backgrounds so as to examine effects of modifying alleles on expression of the IGS4 transgene 
10 and the development of IGS4-like symptoms. One such approach is to cross the IGS4 transgenic 
founder animals with a wild type strain to produce an F1 generation that exhibits 1GS4 related disorder- 
like symptoms, such as those described above. The F1 generation may then be inbred in order to 
develop a homozygous line, if it is found that homozygous target gene transgenic animals are viable. 

15 Vaccines 

Another aspect of the invention relates to a method for inducing an immunological response in a 
mammal which comprises administering to (for example by inoculation) the mammal the IGS4 
polypeptide, or a fragment thereof, if required together with a RAMP polypeptide, adequate to produce 

20 antibody and/or T cell immune response to protect said animal from PNS, psychiatric and CNS 
disorders, including schizophrenia, episodic paroxysmal anxiety (EPA) disorders such as obsessive 
compulsive disorder (OCD), post traumatic stress disorder (PTSD), phobia and panic, major depressive 
disorder, bipolar disorder, Parkinson's disease, general anxiety disorder, autism, delirium, multiple 
sclerosis, Alzheimer disease/dementia and other neurodegenerative diseases, severe mental 

25 retardation, dyskinesias, Huntington's disease, Tourett's syndrome, tics, tremor, dystonia, spasms, 
anorexia, bulimia, stroke, addiction/dependency/craving, sleep disorder, epilepsy, migraine; attention 
deficit/hyperactivity disorder (ADHD); cardiovascular diseases, including heart failure, angina pectoris, 
arrhythmias, myocardial infarction, cardiac hypertrophy, hypotension, hypertension - e.g. essential 
hypertension, renal hypertension, or pulmonary hypertension, thrombosis, arteriosclerosis, cerebral 

30 vasospasm, subarachnoid hemorrhage, cerebral ischemia, cerebral infarction, peripheral vascular 
disease, Raynaud's disease, kidney disease - e.g. renal failure; dyslipidemias; obesity; emesis; 
gastrointestinal disorders, including irritable bowel syndrome (IBS), inflammatory bowel disease (IBD), 
gastroesophagal reflux disease (GERD), motility disorders and conditions of delayed gastric emptying, 
such as post operative or diabetic gastroparesis. and diabetes, ulcers - e.g. gastric ulcer; diarrhoea; 

35 other diseases including osteoporosis; inflammations; infections such as bacterial, fungal, protozoan 
and viral infections, particularly infections caused by HIV-1 or HIV-2; pain; cancers; chemotherapy 
induced injury; tumor invasion; immune disorders; urinary retention; asthma; allergies; arthritis; benign 
prostatic hypertrophy; endotoxin shock; sepsis; complication of diabetes mellitus; and gynaecological 
disorders, among others. Yet another aspect of the invention relates to a method of inducing 
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immunological response in a mammal which comprises delivering the IGS4 polypeptide via a vector 
directing expression of the IGS4 polynucleotide in vivo in order to Induce such an immunological 
response to produce antibody to protect said animal from diseases. In particular the invention relates to 
a method for inducing an immunological response in a mammal which comprises inoculating the 
mammal with the IGS4 polypeptide, or a fragment thereof, if required together with a RAMP polypeptide, 
adequate to produce antibody and/or T cell immune response to protect said animal from disorders of 
the nervous system, including the central nervous system (CNS) and the peripheral nervous system 
(PNS), disorders of the gastrointestinal system and/or of the cardiovascular system and/or of skeletal 
muscle and/or of the thyroid, and/or also from lung diseases, immunological diseases and disorders of 
the genitourinary system. 

A further aspect of the invention relates to an immunologicai/vaccine formulation (composition) 
which, when introduced into a mammalian host, induces an immunological response in that mammal to 
an IGS4 polypeptide wherein the composition comprises an IGS4 polypeptide or IGS4 gene. Such 
Immunologicai/vaccine formulations (compositions) may be either therapeutic immunologlcal/vaccine 
formulations or prophylactic immunologicai/vaccine formulations. The vaccine formulation may further 
comprise a suitable carrier. Since the IGS4 polypeptide may be broken down in the stomach, it is 
preferably administered parenterally (including subcutaneous, intramuscular, intravenous, intradermal 
etc. injection). Formulations suitable for parenteral administration include aqueous and non-aqueous 
sterile injection solutions which may contain anti-oxidants, buffers, bacteriostats and solutes which 
render the formulation isotonic with the blood of the recipient; and aqueous and non-aqueous sterile 
suspensions which may include suspending agents or thickening agents. The formulations may be 
presented in unit-dose or multi-dose containers, for example, sealed ampoules and vials and may be 
stored in a freeze-dried condition requiring only the addition of the sterile liquid carrier immediately prior 
to use. The vaccine formulation may also include adjuvant systems for enhancing the immunogenicity of 
the formulation, such as oil-in water systems and other systems known In the art. The dosage will 
depend on the specific activity of the vaccine and can be readily determined by routine experimentation. 

Screening Assays 

The IGS4 polypeptide of the present invention may be employed in a screening process for 
compounds which bind the receptor and which activate (agonists) or inhibit activation of (antagonists) 
the receptor polypeptide of the present invention. Thus, polypeptides of the invention may also be used 
to assess the binding of small molecule substrates and ligands in, for example, ceils, cell-free 
preparations, chemical libraries, and natural product mixtures. These substrates and ligands may be 
natural substrates and ligands or may be structural or functional mimetics. 

IGS4 polypeptides are responsible for biological functions, including pathologies. Accordingly, it is 
desirable to find compounds and drugs which stimulate 1GS4 on the one hand and which can inhibit the 
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function of IGS4 on the other hand. In general, agonists are employed for therapeutic and prophylactic 
purposes for such conditions as PNS, psychiatric and CNS disorders. Including schizophrenia, episodic 
paroxysmal anxiety (EPA) disorders such as obsessive compulsive disorder (OCD), post traumatic 
stress disorder (PTSD), phobia and panic, major depressive disorder, bipolar disorder, Parkinson's 
5 disease, general anxiety disorder, autism, delirium, multiple sclerosis, Alzheimer disease/dementia and 
other neurodegenerative diseases, severe mental retardation, dyskinesias, Huntington's disease, 
Touretfs syndrome, tics. tremor. dystonia, spasms, anorexia. bulimia, stroke, 
addiction/dependency/craving, sleep disorder, epilepsy, migraine; attention deficit/hyperactivity disorder 
(ADHD); cardiovascular diseases, including heart failure, angina pectoris, arrhythmias, myocardial 

10 infarction, cardiac hypertrophy, hypotension, hypertension - e.g. essential hypertension, renal 
hypertension, or pulmonary hypertension, thrombosis, arteriosclerosis, cerebral vasospasm, 
subarachnoid hemorrhage, cerebral ischemia, cerebral infarction, peripheral vascular disease, 
Raynaud's disease, kidney disease - e.g. renal failure; dyslipidemias; obesity; emesis; gastrointestinal 
disorders, including irritable bowel syndrome (IBS), inflammatory bowel disease (IBD), gastroesophagal 

15 reflux disease (GERD), motility disorders and conditions of delayed gastric emptying, such as post 
operative or diabetic gastroparesis, and diabetes, ulcers - e.g. gastric ulcer; diarrhoea; other diseases 
including osteoporosis; inflammations; infections such as bacterial, fungal, protozoan and viral 
infections, particularly infections caused by HIV-1 or HIV-2; pain; cancers; chemotherapy induced injury; 
tumor invasion; immune disorders; urinary retention; asthma; allergies; arthritis; benign prostatic 

20 hypertrophy; endotoxin shock; sepsis; complication of diabetes mellitus; and gynaecological disorders. 
Antagonists may be employed for a variety of therapeutic and prophylactic purposes for such conditions 
as PNS, psychiatric and CNS disorders, including schizophrenia, episodic paroxysmal anxiety (EPA) 
disorders such as obsessive compulsive disorder (OCD). post traumatic stress disorder (PTSD), phobia 
and panic, major depressive disorder, bipolar disorder, Parkinson's disease, general anxiety disorder. 

25 autism, delirium, multiple sclerosis, Alzheimer disease/dementia and other neurodegenerative diseases, 
severe mental retardation, dyskinesias, Huntington's disease, Tourett's syndrome, tics, tremor, dystonia, 
spasms, anorexia, bulimia, stroke, addiction/dependency/craving, sleep disorder, epilepsy, migraine; 
attention deficit/hyperactivity disorder (ADHD); cardiovascular diseases, including heart failure, angina 
pectoris, arrhythmias, myocardial infarction, cardiac hypertrophy, hypotension, hypertension - e.g. 

30 essential hypertension, renal hypertension, or pulmonary hypertension, thrombosis, arteriosclerosis, 
cerebral vasospasm, subarachnoid hemorrhage, cerebral ischemia, cerebral infarction, peripheral 
vascular disease, Raynaud's disease, kidney disease - e.g. renal failure; dyslipidemias; obesity; 
emesis; gastrointestinal disorders, including irritable bowel syndrome (IBS), inflammatory bowel disease 
(IBD), gastroesophagal reflux disease (GERD), motility disorders and conditions of delayed gastric 

35 emptying, such as post operative or diabetic gastroparesis, and diabetes, ulcers - e.g. gastric ulcer; 
diarrhoea; other diseases including osteoporosis; inflammations; infections such as bacterial, fungal, 
protozoan and viral infections, particulariy infections caused by HIV-1 or HIV-2; pain; cancers; 
chemotherapy induced injury; tumor invasion; immune disorders; urinary retention; asthma; allergies; 
arthritis; benign prostatic hypertrophy; endotoxin shock; sepsis; complication of diabetes mellitus; and 
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gynaecological disorders. Particularly, tlie present invention may be employed in a screening process 
for compounds wliich bind the receptor and which activate (agonists) or inhibit activation of (antagonists) 
the IGS4 neuromedin receptor protein, preferably the mammalian IGS4 neuromedin receptor protein, 
said protein exhibiting high affinity binding for neuromedin U, preferably for neuromedin U-8. for 
5 neuromedin U-23 and/or for neuromedin U-25. These screening assays are particularly suitable for 
screening compounds which are effective with regard to disorders of the nervous system, including the 
central nervous system (CNS) and the peripheral nervous system (PNS). disorders of the 
gastrointestinal system and/or of the cardiovascular system and/or of skeletal muscle and/or of the 
thyroid, and/or also to lung diseases, immunological diseases and disorders of the genitourinary system. 

10 

In general, such screening procedures Involve producing appropriate cells, which express the 
receptor polypeptide of the present invention on the surface thereof and, if essential co-expression of 
RAMP'S at the surface thereof. Such cells include cells from mammals, yeast, Drosophila or E. coli. 
Cells expressing the receptor (or cell membrane containing the expressed receptor) are then contacted 
1 5 with a test compound to obsen/e binding, or stimulation or inhibition of a functional response. 

One screening technique includes the use of cells which express the receptor of this invention (for 
example, transfected CHO cells) in a system which measures extracellular pH, intracellular pH, or 
intracellular calcium changes caused by receptor activation. In this technique, compounds may be 
20 contacted with cells expressing the receptor polypeptide of the present invention. A second messenger 
response, e.g., signal transduction, pH changes, or changes in calcium level, is then measured to 
determine whether the potential compound activates or inhibits the receptor. 

Another method involves screening for receptor inhibitors by determining modulation of a 
25 receptor-mediated signal, such as cAMP accumulation and/or adenylate cyclase activity. Such a method 
involves transfecting an eukaryotic cell with the receptor of this invention to express the receptor on the 
cell surface. The cell is then exposed to an agonist to the receptor of this invention in the presence of a 
potential antagonist. If the potential antagonist binds the receptor, and thus inhibits receptor binding, the 
agonist-mediated signal will be modulated. 

30 

Another method for detecting agonists or antagonists for the receptor of the present invention is 
the yeast-based technology as described in U.S. Patent 5.482,835, incorporated by reference herein. 

The assays may simply test binding of a candidate compound wherein adherence to the cells 
35 bearing the receptor is detected by means of a label directly or indirectly associated with the candidate 
compound or in an assay involving competition with a labeled competitor. Further, these assays may 
test whether the candidate compound results in a signal generated by activation of the receptor, using 
detection systems appropriate to the cells bearing the receptor at their surfaces. Inhibitors of activation 
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are generally assayed in the presence of a known agonist and the effect on activation by the agonist by 
the presence of the candidate compound is observed. 

Thus candidate compounds may be screened which show ligand binding to the IGS4 receptors of 
5 the present invention. In the context of the present invention the term "ligand binding" is understood as 
to describe compounds with affinity to the IGS4 receptors showing log EC50 values of at least below 
-6.00 (approx. 660 nM), preferably log EC50 below -7.00 (approx. 55 nM). more preferably log ECgo 
below -9.00 (approx. 500 pM to 1.2 nM). and most preferably log EC50 below -10.00 (approx. 60-100 
pM). 

10 

Thus in one aspect the invention concerns a method of determining whether a substance is a 
potential ligand of IGS4 receptor comprising 

(a) contacting cells expressing one of the IGS4 neuromedin receptors defined supra or one of the 
receptors of SEQ ID NO:2. SEQ ID NO:4. SEQ ID NO:6 and SEQ ID NO:8. or contacting a receptor 

1 5 membrane preparation comprising one of said IGS4 neuromedin receptors defined supra or one of the 
receptors of SEQ ID NO:2. SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8 with labeled neuromedin U in 
the presence and in the absence of the substance; and 

(b) measuring the binding of neuromedin U to IGS4. 

20 Further, the assays may simply comprise the steps of mixing a candidate compound with a 

solution containing an IGS4 polypeptide to form a mixture, measuring the IGS4 activity in the mixture, 
and comparing the IGS4 activity of the mixture to a standard. . 

The IGS4 cDNA, protein and antibodies to the protein may also be used to configure assays for 
25 detecting the effect of added compounds on the production of IGS4 mRNA and protein in cells. For 
example, an ELISA may be constructed for measuring secreted or cell associated levels of IGS4 protein 
using monoclonal and polyclonal antibodies by standard methods known in the art, and this can be used 
to discover agents which may inhibit or enhance the production of IGS4 (also called antagonist or 
agonist, respectively) from suitably manipulated cells or tissues. Standard methods for conducting 
30 screening assays are well known in the art. 

Examples of potential IGS4 antagonists include antibodies or, in some cases, oligonucleotides or 
proteins which are closely related to the ligand of the IGS4, e.g., a fragment of the ligand, or small 
molecules which bind to the receptor but do not elicit a response, so that the activity of the receptor is 
35 prevented. 

Thus in another aspect, the present invention relates to a screening kit for Identifying agonists, 
antagonists, ligands. receptors, substrates, enzymes, etc. for IGS4 polypeptides; or compounds which 
decrease, increase and/or otherwise enhance the production of IGS4 polypeptides, which comprises: 
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(a) an IGS4 polypeptide, preferably that of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6 or SEQ 
ID NO: 8; 

(b) a recombinant cell expressing an IGS4 polypeptide, preferably that of SEQ ID NO: 2, SEQ ID 
NO: 4. SEQ ID NO: 6 or SEQ ID NO: 8; 

5 (c) a cell membrane expressing an IGS4 polypeptide; preferably that of SEQ ID NO: 2, SEQ ID 

NO: 4, SEQ ID NO: 6 or SEQ ID NO: 8; or 
(d) antibody to an IGS4 polypeptide, preferably that of SEQ ID NO: 2. SEQ ID NO: 4, SEQ ID 
NO: 6 or SEQ ID NO: 8. 

It will be appreciated that in any such kit, (a), (b), (c) or (d) may comprise a substantial component. 

10 

Prophylactic and Therapeutic Methods 

This invention provides methods of treating abnormal conditions related to both an excess of and 
insufficient amounts of IGS4 activity. 

15 

If the activity of IGS4 is in excess, several approaches are available. One approach comprises 
administering to a subject an Inhibitor compound (antagonist) as hereinabove described along with a 
pharmaceutically acceptable carrier in an amount effective to inhibit activation by blocking binding of 
ligands to the IGS4, or by inhibiting interaction with a RAMP polypeptide or a second signal, and thereby 
20 alleviating the abnormal condition. 

In another approach, soluble forms of IGS4 polypeptides still capable of binding the ligand in 
competition with endogenous IGS4 may be administered. Typical embodiments of such competitors 
comprise fragments of the IGS4 polypeptide. 

25 

In still another approach, expression of the gene encoding endogenous IGS4 can be inhibited 
using expression-blocking techniques. Known such techniques involve the use of antisense sequences, 
either internally generated or separately administered. See, for example, O'Connor, J Neurochem 
(1991) 56:560 in Oligodeoxynudeotides as Antisense Inhibitors of Gene Expression. CRC Press, Boca 

30 Raton. Florida USA (1988). Alternatively, oligonucleotides, which form triple helices with the gene, can 
be supplied. See, for example, Lee et al.. Nucleic Acids Res (1979) 6:3073; Cooney et al.. Science 
(1988) 241:456; Dervan et al. Science (1991) 251:1360. These oligomers can be administered per se or 
the relevant oligomers can be expressed in vivo. Synthetic antisense or triplex oligonucleotides may 
comprise modified bases or modified backbones. Examples of the latter include methylphosphonate, 

35 phosphorothioate or peptide nucleic acid backbones. Such backbones are incorporated in the antisense 
or triplex oligonucleotide in order to provide protection from degradation by nucleases and are well 
known in the art. Antisense and triplex molecules synthesized with these or other modified backbones 
also form part of the present Invention. 
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In addition, expression of the IGS1 polypeptide may be prevented by using ribozymes specific to 
the IGS1 mRNA sequence. Ribozymes are catalytically active RNAs that can be natural or synthetic 
(see for example Usman. N, et a!., Cun-. Opin. Struct. Biol (1996) 6(4), 527-33,) Synthetic ribozymes can 
be designed to specifically cleave IGS1 mRNAs at selected positions thereby preventing translation of 
5 the IGS1 mRNAs into functional polypeptide. Ribozymes may be synthesized with a natural ribose 
phosphate backbone and natural bases, as normally found In RNA molecules. Alternatively the 
ribosymes may be synthesized with non-natural backbones to provide protection from ribonuclease 
degradation, for example, 2-O-methyl RNA, and may contain modified bases. 

10 For treating abnormal conditions related to an under-expresslon of IGS4 and its activity, several 

approaches are also available. One approach comprises administering to a subject a therapeutically 
effective amount of a compound which activates IGS4, i.e., an agonist as described above, in 
combination with a pharmaceutically acceptable carrier, to thereby alleviate the abnormal condition. 
Alternatively, gene therapy may be employed to effect the endogenous production of IGS4 by the 

15 relevant cells in the subject. For example, a polynucleotide of the invention may be engineered for 
expression in a replication defective retroviral vector, as discussed above. The retroviral expression 
construct may then be isolated and introduced into a packaging cell transduced with a retroviral plasmid 
vector containing RNA encoding a polypeptide of the present invention such that the packaging cell now 
produces infectious viral particles containing the gene of interest. These producer cells may be 

20 administered to a subject for engineering cells in vivo and expression of the polypeptide in vivo. For 
overview of gene therapy, see Chapter 20, Gene Therapy and other Molecular Genetic-based 
Therapeutic Approaches, (and references cited therein) in Human Molecular Genetics, Strachan T. and 
Read A.P., BIOS Scientific Publishers Ltd (1996). 

25 Any of the therapeutic methods described above may be applied to any subject in need of such 

therapy, including, for example, mammals such as dogs, cats, cows, horses, rabbits, monkeys, and 
most preferably, humans. 

Formuiation and Administration 

30 

Peptides, such as the soluble form of IGS4 polypeptides, and agonists and antagonist peptides or 
small molecules, may be formulated in combination with a suitable pharmaceutical carrier. Such 
formulations comprise a therapeutically effective amount of the polypeptide or compound, and a 
pharmaceutically acceptable carrier or excipient. Formulation should suit the mode of administration, 
35 and is well within the skill of the art. The invention further relates to pharmaceutical packs and kits 
comprising one or more containers filled with one or more of the ingredients of the aforementioned 
compositions of the invention. 
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Polypeptides and other compounds of the present invention may be employed alone or in 
conjunction with other compounds, such as therapeutic compounds. 

Preferred forms of systemic administration of the pharmaceutical compositions include injection, 
typically by Intravenous Injection. Other injection routes, such as subcutaneous, intramuscular, or 
intraperitoneal, can be used. Alternative means for systemic administration include transmucosal and 
transdermal administration using penetrants such as bile salts or fusidic acids or other detergents. In 
addition, if properly formulated in enteric or encapsulated formulations, oral administration may also be 
possible. 

The dosage range required depends on the choice of peptide or compound, the route of 
administration, the nature of the formulation, the nature of the subject's condition, and the judgment of 
the attending practitioner. Suitable dosages are in the range of 0.1-100 ^g/kg of subject. Wide variations 
in the needed dosage, however, are to be expected in view of the variety of compounds available and 
the differing efficiencies of various routes of administration. For example, oral administration would be 
expected to require higher dosages than administration by intravenous injection. Variations in these 
dosage levels can be adjusted using standard empirical routines for optimization, as is well understood 
in the art. 

Polypeptides used in treatment can also be generated endogenously in the subject, in treatment 
modalities often referred to as "gene therapy" as described above. Thus, for example, cells from a 
subject may be engineered with a polynucleotide, such as a DNA or RNA, to encode a polypeptide ex 
vivo, and for example, by the use of a retroviral plasmid vector. The cells are then introduced into the 
subject. 

The following examples are only intended to further illustrate the invention in more detail, and 
therefore these examples are not deemed to restrict the scope of the invention in any way. 
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EXAMPLE 1. THE CLONING OF CDNA ENCODING A NOVEL G PROTEIN-COUPLED RECEPTOR. 

Example la. Homology PCR cloning of a genomic fragment encoding a novel G-protein coupled 
receptor (GPCR). 

5 

A PCR based homology cloning strategy was used to isolate partial genomic DNA sequences 
encoding novel G-protein coupled receptors. Forward (F22) and reverse (R44 and R46) degenerate 
PCR primers were designed in conserved areas of the neurotensin receptor gene family (Vita N. et al. 
[1993] Febs Lett. 317, 139-142; Vita N. et al. [1998] Eur. J. Pharmacol. 360, 265-272) within 
10 transmembrane domains 1 (TM1) and 3 (TM3) and at the boundary between TM3 and Intracellular loop 
2 (12): 



15 



F22 (TM1): 

5'-CTCATCTTCGCGGTGGGC(A or G)C(A,C,G or T)G(C or T)(A.C.G or T)GG-3' (SEQ ID NO: 13) 
R44 (TM3/I2): 

5'-GGCCAGGCAGCGCTCCGCGCT(C or lnosine)A(A or G)(A,C.G or T)C(C or T)(A.C.G or T)GC(A.G 
orT)-3' (SEQ ID NO: 14) 

20 R46 (TM3): 

5 -GAA(A or G)TA(A or G)TAGCC(A or G)CG(A or G)CAGCC(A or T)-3' (SEQ ID NO: 1 5) 

In order to suppress amplification of known members of the neurotensin receptor family, the 3' 
ultimate nucleotide position of primer R44 was chosen in such a way that is was not complementary to 

25 the corresponding position of both NTR1 and NTR2 cDNA. The primary PCR reaction was carried out in 
a eopl volume and contained lOOng human genomic DNA (Clontech), 6 pi GeneAmp™ 10 x PCR buffer 
II (lOOmM Tris-HCI pH 8.3; 500 mM KCI, Parkin Elmer), 3.6 pi 25 mM MgClz. 0.36 pi dNTPs (25mM of 
each dNTP). 1.5 units AmpliTaq-Gold™ polymerase (Perkin Elmer) and 30 pmoles of each of the 
degenerated fonvard (F22) and reverse primer (R44). Reaction tubes were heated at 95*^0 for 10 min 

30 and then subjected to 35 cycles of denaturation (95*^0, 1 min), annealing (55**C, 2 min) and extension 
(72**C, 3min). Finally reaction tubes were heated for 10 min at 72X. 

For the semi-nested PCR reaction 1 pi of a 1/50 dilution of the primary PCR reaction was used 
as a template using the degenerate forward and reverse primers F22 and R46 respectively. The semi- 
nested PCR reaction was carried out under the same conditions as the primary PCR reaction. 

35 Semi-nested PCR reaction products were size fractionated on an agarose gel and stained with 

ethidium bromide. Although a fragment of ± 220 bp was expected, only a fragment of ± 120 bp was 
visible. This fragment was purified from gel using the Qiaex-ll™ purification kit (QIagen) and ligated into 
the pGEM-T plasmid according to the procedure recommended by the supplier (pGEM-T kit, Promega). 
The recombinant plasmids thus produced were used to transform competent E. coll SURE™ 2 bacteria 

40 (Stratagene). Transfonmed cells were plated on LB agar plates containing ampicillin (100 \JQ/m\), IPTG 
(0.5 mM) and X-gal (50 MQ/ml). Plasmid DNA was purified from mini-cultures of individual colonies using 
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a Qiagen-tip 20 miniprep kit (Qiagen). DNA sequencing reactions were carried out on the purified 
plasmid DNA with the ABI Prism™ BigDye™ Terminator Cycle Sequencing Ready Reaction kit (PE-ABI), 
using insert-flanking primers. 



Table 7: Overview of ollgo primers used. 



SEQ ID NO: 13 


F22: 5-CTCATCTTCGCGGTGGGC(A or G)C(A,C.G or T)G(C or T)(A,C,G or 
T)GG-3' 


SEQ ID NO: 14 


R44: 5-GGCCAGGCAGCGCTCCGCGCT(C or lnosine)A(A or G)(A.G,G or T)C(C 
or T)(A,C,G or T)GC(A.G or T)-3' 


SEQ ID NO: 15 


R46: 5'-GAA(A or G)TA(A or G)TAGCC(A or G)CG(A or G)CAGCC(A or T)-3' 


SEQ ID NO: 16 


API : 5 -CCATCCTAATACGACTCACTATAGGGC-3' 


SEQ ID NO: 17 


AP2: 5'-ACTCACTATAGGGCTCGAGCGGC-3* 


SEQ ID NO: 18 


IGS4R1 : 5"GGATCCCAAATAAGAAAGGGTAGTTGC-3' 


SEQ ID NO: 19 


IGS4R2: 5'AAAGGGTAGTTGGGCCACATCTCATAGAC-3' 


SEQ ID NO: 20 


IGS4F5: 5'AGGTGTATGAGATGTGGCGCAACTACCCT-3' 


SEQ ID NO: 21 


IGS4F6: 5*ATGTGGCGCAACTACCCTTTCTTATTTGGG-3' 


SEQ ID NO: 22 


R74: 5 -CGGAAGTTGGCGGACACG(A or G)(A,C or G)(A or G)TT(A or G)TA-3' 


SEQ ID NO: 23 


IGS4F7: 5'-GCTCAGGTTGAAACAGAGCCTCGTACC-3' 


SEQ ID NO: 24 


1 GS4F8: S'-CC ATGTGG ATCTACAATTTCATCATCC-3' 


SEQ ID NO: 25 


IGS4F9: 5-AAGACAAATCTCTTGAGGCAGATGAAGGG-3* 


SEQ ID NO: 26 


IGS4F1 0: 5*-GATGCTGTTTGTCTTGGTCTTAGTGTTTGC-3' 


SEQ ID NO: 27 


IGS4R5: 5'-GGATGATGAAATTGTAGATCCACATGGGC-3' 


SEQ ID NO: 28 


IGS4R6: S'-TGTGGAGAAGTCTCTCAAAGTGTGG-S* 


SEQ ID NO: 29 


IGS4R7: 5-TAGTAGGAGTGACAGCCTGACTCGGAACG-3* 


SEQ ID NO: 30 


IGS4R8 : 5'-AACGTAG ATGACTC AGGAGG AACCATTTCC-3' 


SEQ ID NO: 31 


IGS4F1 1 : 5'-TCGTACCAGGGGAGGCTCAGGC-3' 



Sequencing reaction products were purified via EtOH/NaOAc precipitation and analyzed on an ABI 377 
automated sequencer. 

Sequence analysis of the insert of clone HNT1552 showed that it potentially encoded part of a novel 
member of the GPCR family. We refer to this novel GPCR sequence as IGS4. 

Example 1b. Cloning of cDNA fragments containing the complete IGS4 coding sequence. 

The complete coding sequence of IGS4 cDNA was obtained via both RACE analysis (rapid 
amplification of cDNA ends) and RT-PCR amplification. 5'- and 3' RACE PGR reactions were performed 
on Marathon-Ready'^'^ cDNA from human brain or testis (Clontech n*' 7400-1 and 7414-1 respectively), 
using the adaptor primers 1 and 2 (AP1: SEQ ID NO: 16 ; AP2: SEQ ID NO: 17) provided with the 
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WO 01/25269 PCT/EPOO/09584 

-41 - 

Marathon™ cDNA amplification kit (Clontech K1 802-1) and IGS4 specific primers. PGR RACE reactions 
were performed according to the Instructions of the Marathon-Ready"^" cDNA user manual provided by 
Clontech. RACE products were separated on agarose gel. visualized with ethidium bromide and blotted 
onto Hybond N* membranes. Blots were prehybridized at es'^C for 2 h in modified Church buffer (0.5M 
5 phosphate, 7% SDS, 10 mM EDTA) and then hybridised overnight at 65**C in the same buffer containing 
2x10^ cpm / ml of a ^^P-labelled IGS4 cDNA probe. IGS4 cDNA probes were radiolabelled via random 
primed incorporation of [oL-^^P]dCTP to a specific activity of > 10® cpm/pg using the Prime-It II kit™ 
(Stratagene) according to the instructions provided by the supplier. Hybridized blots were washed at 
high stringency (2 x 30 min at room temperature in 2 x SSC/ 0.1% SDS. followed by 2 washes of 40 min 

10 at 65**C in 0.1 x SSC, 0.1% SDS) and autoradiographed overnight. Hybridizing fragments were purified 
from a preparative gel, cloned into the pGEM-T vector and sequenced as described above. 

An initial round of semi-nested 5* RACE analysis on human brain cDNA using the IGS4 specific 
primers IGS4R1 (SEQ ID NO: 18) and IGS4R2 (SEQ ID NO: 19)(designed on the DNA sequence of 
clone HNT1552) yielded clones HNT1886 and HNT1887 (Fig.1). These clones extended the IGS4 cDNA 

15 sequence upstream up to and beyond the putative start of translation codon. Likewise an initial round of 
3' RACE analysis on human brain cDNA using IGS4 specific primers IGS4F5 (SEQ ID NO: 20) and 
IGS4F6 (SEQ ID NO: 21) yielded clones HNT1 874-1 878 and HNT1 902-1 903 (Fig.1). These clones 
extended the known IGS4 cDNA at the 3' end. 

All sequences obtained at this point were assembled into a single contig which contained a long 

20 open reading frame, encoding part of a novel protein that was most similar to human orphan receptor 
FM-3 (Tan et al.. Genomics 52, 223-229 [1998], GenBank accession n"* AF044600 and AF044601). To 
investigate the RNA expression profile of IGS4, a Master Blot^"^ membrane (Clontech cat n° 7770-1 ) 
containing RNA from different human tissues was hybridized to the ^^P-labelled insert of clone HNT1903 
under the conditions recommended by the supplier. The strongest hybridization was obtained with testis 

25 RNA whereas much weaker signals were obtained in prostate, stomach, spina! cord, hippocampus, 
medulla oblongata, thyroid gland, thymus, lung and trachea. 

Since the contig sequence did not yet contain the complete IGS4 coding sequence we set up an 
RT-PCR homology cloning experiment on human total brain RNA using IGS4 specific primer IGS4F6 
(SEQ ID NO: 21) and a degenerated primer (R74, SEQ ID NO: 22), which was designed in a conserved 

30 area (at the TM7/C-terminal intracellular part) of the GPCR subfamily composed of the neurotensin 
receptors 1 and 2, the growth hormone secretagogue receptor (Howard A.D. et a!.[1996] Science 273, 
974-977) and the orphan GPCR FM-3 and GPR38 (McKee K.K. et al.[1997] Genomics 46. 426-434), 
RT-PCR reactions were carried out in a 50 pi volume on 500 ng total RNA from human brain using the 
Titan"^" One Tube RT-PCR System (Boehringer catalogue n° 1,888.382) according to the 

35 recommendations of the supplier. Briefly. RT-PCR conditions were as follows: reverse transcription for 
45 min at 55°C; 2 min denaturation at 94''C, followed by a touch-down PCR reaction of 20 cycles (30 
sec denaturation at 94X, 30 sec annealing at 60**C t-0.25**C/cycle] and 2 min extension at 68**C) and an 
additional round of 30 PCR cycles (30 sec denaturation at 94X, 30 sec annealing at 55X and 3 min [+ 
5 sec/cycle] extension at 68**C). This was concluded with an extra extension step of 7 min at eS'^C. 
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Reaction products were analyzed via Southern blotting using the radiolabelled insert of clone HNT1903. 
A fragment of ± 690 bp that hybridized to the probe was purified from the gel (Qiaexir", Qiagen) and 
cloned into the pGEM-T vector yielding clones HNT2210'2212. Sequence analysis of these clones 
allowed to extend the existing IGS4 cDNA contig in the 3' direction. 
5 Since the extended IGS4 cDNA contig still did not yet contain a translatlonal stop codon, 

additional IGS4 specific 3' RACE primers were designed (IGS4F7-10, SEQ ID NO: 23-26)). Nested or 
semi-nested 3' RACE reactions were carried out on Marathon Ready™ cDNA from human testis 
(Clontech 7414-1). IGS4 specific bands (as assessed via Southern blot analysis using an IGS4 specific 
probe) were cloned into pGEM-T. This yielded clones HNT2289-90 {AP1/IGS4F5->AP2/IGS4F9), 

10 HNT2293-2295 {AP1/IGS4F6->AP2/IGS4F9), HNT2296-2297 (AP1/IGS4F7->AP2/IGS4F9), HNT2308- 
2310 (AP1/IGS4F8->AP2/IGS4F10) HNT2253 (AP1/IGS4F7->AP1/IGS4F5). An additional 5' RACE 
PGR reaction carried out on testis Marathon Ready™ cDNA yielded clones HNT2279-2281 
(AP2/IGS4R6->AP2/IGS4R5). (note: AP1/IGS4F5->AP2/IGS4F9 e.g. Indicates that clones were 
generated from an IGS4 specific fragment obtained after the primary RACE PCR reaction [using primer 

15 pair AP1/IGS4F5] was nested with primer pair AP2/IGS4F9). 

Sequence analysis of these clones allowed to extend the existing IGS4 cDNA contig further in the 3' 
direction although the end of the IGS4 coding sequence was not yet been reached. A computer-assisted 
homology search (Blastn; Altschul S.F. et al.. Nucleic Acids Res. (1997) 25:3389-3402) of the IGS4 
contig DNA sequence against the expressed sequence tag (EST) database (dbest) showed the 

20 presence of an EST sequence (accession n** N45474) which overlapped with the 3' end of the IGS4 
contig (near 100 % identity in the overlap area). EST N45474 further extended the IGS4 DNA contig at 
the 3' end into a translatlonal stop codon and into the 3' untranslated region (3'-UTR). In addition 
another set of ESTs was identified which all covered the 3'-UTR of the IGS4 mRNA (Fig.2). Additional 
IGS4 specific primers (IGS4R7-8, SEQ ID NO: 29-30)) were designed within the 3*-UTR of these ESTs. 

25 Primary PCR reactions were carried out on Marathon Ready™ cDNA from human testis using various 
combinations of the IGS4F7 (SEQ ID NO: 23). IGS4F11 (SEQ ID NO: 31) and IGS4R7-8 (SEQ ID NO: 
29-30) primers. PCR tubes were heated for 2 min at 95**C and then subjected to 35 cycles of 
denaturation (95°C. 30 sec), annealing (65°C. 30 sec) and extension (72X, 1 mIn 30 sec). Finally the 
reactions tubes were heated at 72°C for 10 min. Nested PCR reactions were also carried out under the 

30 same conditions. DNA fragments of ± 1630 bp were purified from gel and cloned into the pGEM-T 
vector. The following clones were obtained: HNT2311. HNT2312 and HNT2317 (IGS4F7/IGS4R7- 
>IGS4F11/IGS4R8); HNT2313, HNT2324. HNT2326 and HNT2328 (IGS4F11->IGS4R8); HNT2314, 
HNT2315 and HNT2322 (IGS4F1 1->R7). Clone HNT2363 was obtained from a purified 1630 bp PCR 
fragment, that was amplified from human testis Marathon Ready™ cDNA using the IGS4F11/R7 primer 

35 pair under the following slightly modified conditions. After an initial denaturation of 2 min at 94°C. PCR 
tubes were subjected to 15 cycles of denaturation [15 sec. 94°C]. annealing [30 sec. 65°C] and 
extension [2 min, 72°C] followed by another 20 cycles of denaturation [15 sec, 94*'C], annealing [30 sec. 
65°C] and extension [ 2 min, 72°C; +10sec/cycle]. There was a final extension step of 7 min at 72°C. 
Sequence analysis of these clones allowed to assemble an IGS4 cDNA consensus sequence (Fig.1). 
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Close inspection of all clones showed that they actually were of 2 sequence types, which differed at 5 
nucleotide positions. These variant sequences correspond to a polymorphism within the human 
population. We refer to these different cDNA types as IGS4ADNA (SEQ ID NO: 1 and SEQ ID NO: 3) 
and IGS4BDNA (SEQ ID NO: 5 and SEQ ID NO: 7). The consensus sequence contained a long open 
reading frame that contained two in-frame start codons (positions 55-57 (SEQ ID NO: 1 and SEQ ID 
NO: 5) and 64-66 (SEQ ID NO: 3 and SEQ ID NO: 7) in IGS4ADNA and IGS4BDNA). predicting a 
protein of either 415 (SEQ ID NO: 2 and SEQ ID NO: 6) or 412 (SEQ ID NO: 4 and SEQ ID NO: 8) 
amino acids, which showed good homology to GPCR proteins. Hydropathy analysis (Kyte J. et al.[1982] 
J. Mol. Biol. 157: 105-132; Klein P.et al.[1985] Biochim. Biophys. Acta 815:468-476) of the protein also 
indicated the presence of 7 transmembrane domains. Since the first ATG initiator codon Is within a weak 
"Kozak" translation initiation context and the second one is in a strong Kozak context. It is likely that the 
IGS4A/B protein starts at the second methionine and is 412 amino acids long (Kozak M. [1999] Gene 
234, 187-208). However some (or even exiusive) initiation at the first ATG cannot be excluded. Among 
the five polymorphic nucleotides, four (positions 947. 999, 1202 and 1216 in IGS4A/BDNA) resulted in a 
switch in the encoded amino acid residue, whereas the fifth (pos 1381 in IGS4A/BDNA) was within the 
3"-UTR. The respective predicted protein sequences are referred to as IGS4APROT (SEQ ID NO: 2 and 
SEQ ID NO: 4) and IGS4BPROT (SEQ ID NO: 6 and SEQ ID NO: 8). (note 1: the sequence of 
IGS4APROT and IGS4BPROT in this document is represented as the longest possible (415 amino 
acids) sequence but it is understood that the actual protein might be 3 amino acids shorter at the amino- 
terminus; for this reason the first 3 amino acids of IGS4APROT and IGS4BPROT In Table 4 and 5 have 
been bracketed) (note 2: In this document IGS4 refers to the IGS4 sequence in general. Irrespective of 
the particular allelic type). Homology searches of the IGS4 protein sequence against public domain 
protein databanks showed best homology to the human orphan GPCR FM-3 (accession no 043664, 
Tan CP., et al. Genomics (1998) 52: 223-229; 46% identity in IGS4A amino acid residues 26-342). 



Homology searches of DNA databanks with the IGS4 cDNA sequence yielded a number of entries 
which were also derived from the IGS4 gene locus (Flg.2 for overview): 

• 10 EST sequence entries (accession nrs W61 169. AI432384, W61131 , AI023570. F01358. F03770. 
Z38158. R40869. R37725. H11333). 2 STS (sequence tagged sites) (accession nrs G20615 and 
G05725) and one genomic sequence (accession nr AQ078563) were discovered which were all 
derived from the 3'-UTR of IGS4 cDNA. 

• EST accession n° N45474 encoded the 3' end of the IGS4 coding sequence and part of the 3' UTR 
(cfr supra). 

• A 'working draft' high throughput genomic sequence (accession nr AG008571 , version AC008571 .1 , 
deposited 3 AUG 1999), which consisted of 42 unordered contigs assembled in an arbitrary order 
was discovered in which we detected the entire IGS4 cDNA sequence in 4 separate areas. These 
areas most likely correspond to the different IGS4 exons as they were flanked by canonical splice 
donor and acceptor sequences. On the basis of this analysis the position of the different exons in 
the IGS4ADNA (or IGS4BDNA) sequence can be defined as follows: exoni (1-780), exon 2 (781- 
865). exon 3 (866-991) and exon 4 (992-1658), The AC008571 genomic sequence is of the IGS4A 
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allelic type. 

• 6 overlapping EST entries (accession nrs H11359. R13890, R13353, F07531, F05108, F05107) 
were discovered of which the assembled DNA sequence overlapped at its 3' end with IGS4 exon2 
and the beginning of exon 3. However the DNA sequence upstream of exon 2 was completely 

5 different from IGS4 exon1. Probably these six ESTs are derived from transcripts which originated 

from an alternative promoter. 

• Finally 2 genomic sequence entries (accession nrs AQ019411 and AQ015065) were discovered 
which encoded exon 2. 

Among the many IGS4 cDNA clones that we isolated In the different experiments described above, 
we also discovered a number of clones that contained a 64 bp deletion (pos 866-929 in [GS4ADNA) 
(besides a number of clones derived from unspliced [or partially spliced] transcripts). So far we only 
discovered truncated transcripts of the polymorphic type A. We refer to this splice variant cDNA 
sequence as IGS4A-64DNA (SEQ ID NO: 9 and SEQ ID NO: 11), Since this deletion occurs exactly at 
the exon 2/exon 3 boundary and since the last 2 nucleotides of the deleted fragment are "AG". It is likely 
that this deletion represents an alternative splicing event in which the "AG" within exon 3 served as a 
splice acceptor. The IGS4A reading frame encoded by the splice variant is frameshifted beyond the 
deletion point. The encoded (truncated) protein of 296 amino acids Is referred to as IGS4A-64PROT 
(SEQ ID NO: 10 and SEQ ID NO: 12). Hydropathy analysis of the IGS4A-64PROT sequence shows that 
this protein only contains 5 transmembrane domains (corresponding to TM domains 1-5 of 
IGS4APROT). This truncated receptor might have physiological relevance. 

The bacteria! strain harboring plasmid HNT2322 (containing the IGS4ADNA insert) was 
recloned after replating on LB agar plates containing 100 pg ampicillln/ml and deposited both in the 
25 Innogenetics N.V. strain list (1CCG4320) and at the "Centraalbureau voor Schimmelculturen (CBS)" in 
Baarn. The Netherlands (accession n** CBS1 02221). Plasmid DNA was prepared from the recloned 
isolate and the insert was resequenced and found to be identical to the I6S4ADNA sequence. 

The bacterial strain harboring plasmid HNT2363 (containing the IGS4BDNA Insert) was 
recloned after replating on LB agar plates containing 100 pg amplclllin/ml and deposited both in the 
30 Innogenetics N.V. strain list (fCCG4340) and at the "Centraalbureau voor Schimmelculturen (CBS)" in 
Baarn. The Netherlands (accession n** CBS1 02222). Plasmid DNA was prepared from the recloned 
isolate and the insert was resequenced and found to be identical to the IGS4BDNA sequence. 



wo 01/25269 PCT/EPOO/09584 

-45- 

EXAMPLE 2. SPECIFIC CHANGES IN INTRACELLULAR CALCIUM CONCENTRATIONS INDUCED 
IN CHOG 1 6-IGS4 CELLS BY NEUROMEDIN U. 

Example 2a. Experimental Procedures: Method and Materials. 

5 

A. Method and Materials for IGS-4 transfected CHOG 16-cells. 

The following materials were used in the experiments: Vector containing IGS4-DNA sequence (IGS4- 
pcDNA3.1); SuperFect Transfection Reagent (QIagen); Nut-Mix F12 (Gibco) with 10% PCS, 0.028mg/ml 
Gentamycin (Gibco); 0.22mg/ml Hygromycin (Gibco). 
10 Materials used for clone selection: Nut-Mix F12 with 10% PCS; 0.028mg/ml Gentamycin; 0.22mg/ml 
Hygromycin and 0.55mg/ml Geneticin (Gibco). 

The following method was applied: Transfection with SuperFect Transfection Reagent was carried out 
as described by the manufacturer (Qiagen). Cells were plated in 24-well plates to 50% confluence. Per 
15 well O.Spg/pl plasmid-DNA with Ipl SuperFect Transfection Reagent was added. After 24 hours the 
medium was changed and transfected cell clones were selected by Geneticin-containing selection- 
medium. IGS4 expressing cell clones were characterized by RT-PCR and Northern Blot. 

B. Method and Materials for FLIPR-Assay. 

20 Cell Preparation: 

For cell preparation the following materials were employed: plates : clear, flat-bottom, black well 96-well 
plates (Costar); Media : growth medium: Nut-Mix F-12 (HAM) with Glutamax (Gibco) supplemented with 
10% fetal calf serum (Gibco); Incubator : 5% COj. S/^C (Nuaire). 

25 The method was performed as follows: Cells were seeded 24 hours or 48 hours prior to the experiment 
into black wall microplates. The cell density was 0.8x10-* cells/well for 48 hour incubation and 2.2x10** 
cellsAA^ell for 24 hour incubation. All steps were done under sterile conditions. 

Dve loading : 

30 In order to observe changes in intracellular calcium levels, cells must be ^loaded* with a calcium- 
sensitive fluorescent dye. This dye. called FLUO-4 (Molecular Probes) is excited at 488nm, and emits 
light in the 500-560nm range, only if a complex with calcium is formed. The dye was used at 4pM final 
concentration. Pluronic acid was added to increase dye solubility and dye uptake into the cells, 
Probenicid, an anion exchange protein inhibitor, was added to the dye medium to increase dye retention 

35 in the cells. 

The following materials were used: 

• 2mM dyp stock; Img Fluo-4 (Molecular Probes) solubilized in 443pl low-water DMSO (Sigma). 
Aliquots stored at -20. 
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• 20% pluronic acid solution: 400nng pluronic acid (Sigma) solubilized in 2ml low-water DMSO (Sigma) 
at 37*'C. Stored at room temperature. 

• Dve/pluro nic acid mixture : Immediately before use. equal volumes of the dye stock and 20% 
pluronic acid were mixed. The dye and pluronic acid had a final concentration of 1mM and 10%. 

5 respectively. 

• Probenicid. 250mM stock solution: 710mg probenicid (Sigma) solubilized in 5ml 1N NaOH and 
mixed with 5ml Hank's BSS without phenol red (Gibco) supplemented with 20mM HEPES. 

• Loading-Buffer: 10.5ml Hank's BSS without phenol red (Gibco) supplemented with 20mM HEPES, 
105|j| probenicid, 210mI 1M HEPES. 

10 • Wash-Buffer : Hank's BSS without phenol red (Gibco) supplemented with 20mM HEPES (Gibco) 
and 2.5mM probenicid. 

The method was worked as follows: The 2mM stock of dye was mixed with an equal volume of 20% 
(w/v) pluronic acid immediately before adding to the loading-Buffer. The growth-medium was aspirated 

15 out of the well without disturbing the confluent cell layer. lOOpI loading medium was dispensed into each 
well using a Multidrop (Labsystems). Cell were Incubated in a 5% CO2, ST^'C incubator for 30 minutes. In 
order to calculate the background fluorescence, some wells were not dye loaded. The background 
fluorescence in these wells results from autofluorescence of the cells. After dye loading, cell were 
washed three times with Wash-Buffer (automated Denley cell washer) to reduce the basal fluorescence 

20 to 20.000-25.000 counts above background. 100 I buffer was added and ceil were incubated at 37**C till 
start of the experiment 

C. Preparation of compound plates. 

The peptides were prepared at 3pM (3x the final concentration) for initial screening. For concentration 
25 response curves peptide-solutions were prepared in concentration ranges from 30|jM to lOOnM. All 
peptides were diluted in buffer containing 0.1% BSA (Sigma). 

The following materials were used: Peptides : porcine Neuromedin U25, rat Neuromedin U-23, porcine 
Neuromedin U-8 (Bachem); Dilutionbuffer : Hank's BSS without phenol red (Gibco) supplemented with 
20mM HEPES (Gibco) and 0.1% BSA (Sigma); plates : clear, flat-bottom, 96-well plates (Costar). 

30 

D. Assay. 

The FLIPR setup parameters were set to 0.4 sec exposure length, filter 1, 50|jl fluid addition, pipettor 
height at 125|jl. Dispense Speed 40Ml/sec without mixing. 

35 Example 2b. Results. 

To identify the endogenous ligand for the orphan G protein coupled receptor (GPCR) IGS4, IGS4 (both 
forms IGS4A and 1GS4B) was stably transfected in Chinese Hamster Ovary (CHO) cells. Since the G 
protein coupling mechanism of IGS4 was unknown, a specific CHO-cell strain was used. These CHO- 
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cells stable express the G-protein G 16 (CHOG 16, Molecular Devices), which Is known as "universal 
adapter for GPCRs (Milllgan G.. Marshall F. and Rees S. (1996). Gal 6 as a universal G protein 
adapter: implications for agonist screening strategies. TIPS XL'. 235-237). 

The resulting CHOG 16-IGS4 cells were functionally screened on a Fluorometric Imaging Plate Reader 
5 (FLIPR) to measure mobilisation of Intracellular calcium in response to putative peptide ligands. 

At the concentration of 10nM neuromedin U-23 induced a large, transient and robust calcium-response. 
In contrast, CHOG 16 cells and CHOG 16 cells expressing another, unrelated orphan GPCR. did not 
respond to neuromedin U-23. The results of these experiments with IGS4B are shown in Fig. 4. 

10 Furthermore, the concentration dependence of IGS4 activation by porcine and rat neuromedin U 
Isoforms were investigated (for both forms IGS4A and IGS4B). In the range of lO-^-IO"'^ M porcine 
neuromedin U-25, rat neuromedin U'23, porcine neuromedin U-8 induced specific IGS4-mediated 
calcium mobilisation in the FLIPR assay. All three Neuromedin U isoforms tested caused the same 
maximal activation of [GS4B with LogE.Cso values of -10.09 ± 0.08 (neuromedin U-8. n=4; 80 pM), 

1 5 -10.61 .± 0.08 (neuromedin U-23. n=10; 50 pm) and -9.14 ± 0.09 (neuromedin U-25, n=3; 1 .2 nM). Thus, 
all three peptides cause potent activation of in particular IGS4B. suggesting that neuromedin U is the 
natural agonist for this receptor. The results of these experiments with IGS4B are shown in Fig. 3a 
(neuromedin U-8), Fig. 3b (neuromedin U-23) and Fig. 3c (neuromedin U-25). 

20 For the IGS4A receptor somewhat lower affinities were founds but still showing that the neuromedin U 
peptides are good ligands for IGS4 receptors in general. The log EC50 values found for IGS4A were as 
follows; for neuromedin U-8: log EC50 = -9.3 ± 0.09 (n=1; 485 pM); for neuromedin U-23: log EC50 = 
-7.27 ± 0.16 (n=6; 53 nM); and for neuromedin U-25: log EC5o = -6.18 ± 0.14 (n=3; 658 nM). 

25 The calcium mobilisation response seen following activation of IGS4 by neuromedin U suggests that this 
receptor is coupled to G proteins of the Gq/1 1 subfamily. In addition, basal levels of intracellular cAMP 
were not modulated by porcine neuromedin U-8 (1 and 10|jM) in CHOG 16-IGS4 cells, suggesting that 
this receptor does not couple to G proteins of the Gs subfamilies (data not shown). 

30 

EXAMPLE 3. IGS4 HYBRIDIZATION ON HUMAN MULTIPLE TISSUE EXPRESSION ARRAY 
(MTE™) 

Human IGS4A DNA (± 730 bp BamHI / Hindi!! insert from pGEMT-hlGS4A [ICCG #4320] ) was 
35 radiolabelled via random primed incorporation of [V^^P]-dCTP to a specific activity of > 10® cpm/pg 
using the Prime-It II kit™ (Stratagene). The labeled probe was purified from free label via Sephadex G- 
50 chromatography, denatured for 5 min. at 95**C and added to the ExpressHyb hybridization solution at 
a final concentration of 1-1.5 x 10^ cpm/ml. The human Multiple Tissue Expression (MTE^"^) Array 
(Clontech # 7775-1) was prehybridized and hybridized in ExpressHyb solution at 65*^0 for 30 min and 
40 overnight respectively according to the recommendations of the supplier. 
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The hybridized MTE™ array was washed 5 times for 20 min in 2 x SSC. 1% SDS at 65**C and then 2 
times for 20 min at 55X in 0.1 x SSC, 0.5% SDS. After the washes the array was autoradlographed via 
phosphorimaging (Cyclone Storage Phosphor System, Packard) (Fig.5). Hybridization data of the 
5 MTE™ array were analyzed quantitatively using the OptiQuant Image Analysis Software (Packard). 
Signal intensity of different spot positions containing RNA was corrected for the average background 
signal obtained from empty positions. The signal intensity obtained from the spot containing E. coli DNA 
was considered to represent a sample exhibiting no IGS4 expression. Samples with signal Intensities 
below that of E. coli DNA were considered to be negative. 

10 

Hybridization signals for different tissues on the RNA array have been recalculated by subtracting each 
value with the hybridization signal observed for E. coli DNA (which is considered as the background 
signal). All tissues showing a lower hybridization signal are considered to be below background and to 
15 be IGS4 negative. Expression levels relative to that found in testis (100%) have been plotted and are 
provided in Fig.7. 

EXAMPLE 4. TISSUE DISTRIBUTION OF IGS4 BY NORTHERN BLOT ANALYSIS 

20 

Human IGS4A DNA (± 730 bp BamHI / Hindlll insert from pGEMT-hlGS4A [ICCG #4320] ) was 
radiolabelled via random primed incorporation of [V^^P]-dCTP to a specific activity of > 10® cpm/pg 
using the Prime-It II kit™ (Stratagene). The labeled probe was purified from free label via Sephadex G- 
50 chromatography, denatured for 5 min. at 95**C and added to the ExpressHyb hybridization solution at 
25 a final concentration of 1-1.5 x 10^ cpm/ml. The human Northern blots ( Clontech #7760-1, #7759-1, 
#7767-1. #7755-1 and #7769-1) were prehybridized and hybridized in ExpressHyb solution at SS^'C for 
30 min and overnight respectively according to the recommendations of the supplier. 

After hybridization Northern blots were washed 4 times 10 min at room temperature in 2 x SSC, 0.05 % 
30 SDS and then 2 times 40 min at 50°C in 0.1 x SSC, 0.1% SDS. After the washes the Northern blots 
were autoradiographed using phosphor storage plates (Cyclone Storage Phosphor System, Packard) 
and X-ray films. Results of Northern blots are shown in Fig.6. 

The results of the Northern blot analysis appear to be largely consistent with those from the array 
35 hybridization (Example 3). The strongest signal (2.4 kb transcript) by far is found in testis. A weak 2.4 kb 
band was found in thymus, spinal cord, medulla, thyroid, thalamus, substantia nigra and a very weak 
band in corpus callosum, caudate, nucleus and stomach. For some tissues no 2.4 kb band could be 
seen on Northern whereas a strong to moderate hybridization signal was observed on the MTE array 
(e.g. whole brain, cerebral cortex, lung, temporal and frontal lobe, amygdala, cerebellum, kidney and 
40 hippocampus). 
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EXAMPLE 5. QUANTITATIVE RT-PCR ANALYSIS. 

IGS4 expression levels in different human tissues were also determined via real-time quantitative RT- 
PCR (Q-PCR) using the LightCycler^'^ instrument (Roche Diagnostics) and IGS4 specific TaqMan™ 
probes. 

Example 5a. Experimental procedures. 

A. cDNA synthesis. 

Prior to reverse transcription 3 pg total RNA from the human total RNA panels I to V (Clontech # K4000- 
1 to K4004-1) was treated with 3U DNAse I (Life Technologies # 18068-015) in a 30 pi reaction volume 
(20 mM Tris pH 8.3, 50 mM KCI, 2 mM KCI) for 15 min at room temperature to destroy possibly 
contaminating genomic DNA. The reaction was stopped by adding 3 pi 25 mM EDTA and heating for 10 
min at 65°C. 2,6 pg of the DNAse treated RNA was annealed with 1 ,3 pg oligo(dT) (Life Technologies # 
18418-012) and subjected to reverse transcription using the Omniscript reverse transcriptase (Qiagen 
cat n° 2051 11 ) for 1 h at 37°C In a 52 pi reaction volume according to the protocol recommended by the 
supplier of the enzyme. The Omniscript reverse transcriptase was inactivated by heating at 93^*0 for 5 
min. 

B. Q-PCR. 

Quantitative PGR reactions were carried out in a 20 pi reaction mixture, containing IX TaqMan™ 
Universal PGR Mastermix (PE Applied Biosystems cat #4304437). 0.12 mg BSA/ml. 900 nM of IGS4 
specific forward and reverse primers (IP14.963 and IP14,964), 250 nM of the IGS4 specific TaqMan™ 
probe (IP14.962) and either 1.6 pi (IGS4) or 0.16 pi (GAPDH: glyceraldehyde-3 phosphate 
dehydrogenase) of the cDNA synthesis reaction as template. To set up the IGS4 standard curve a 
dilution series (10^ -lO^copies/ reaction) of IGS4 plasmid ICCG 4320 was used whereas for the GAPDH 
standard curve a dilution series of the human brain cDNA synthesis reaction (0.16 pl, 0.016 and 0.0016 
pi) was used as template. The IX TaqMan^*^ Universal PGR Mastermix contained AmpliTaq Gold™ 
DNA polymerase. AmpErase™ UNG (uracil-N-glycosylase), dNTPs with dUTP, passive reference and 
optimized buffer components. IGS4 specific primers and TaqMan probe were designed using the Primer 
Express™ software (PE Applied Biosystems). Quantitative PGR reactions for human GAPDH were 
carried out under Identical conditions as described for IGS4 except that GAPDH specific primers and 
TaqMan™ probe were used from the TaqMan™ GAPDH control reagents kit (PE Applied Biosystems 
cat n° 402869; sequence information not available from PE, Applied Biosystems). 

PGR reactions were carried out in glass capillary cuvettes in the LightGycler™ instrumenL After 
an initial incubation at 50*C for 2 min to allow the AmpErase™ UNG reaction to proceed and activation 
of the AmpliTaq Gold DNA polymerase (95*C for 10 min), reaction mixtures were subjected to 40 cycles 
of denaturation (15 sec at 95**C) and annealing/extension (1 min at either 60 [GAPDH] or at 68*C 
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[IGS4]), Quantification of experimental samples was carried out using the LightCycler Software version 
3.0. A good linear relationship was obtained between the amount of IGS4 plasmid and the release of 
reporter dye within the range of 10-10^ IGS4 plasmid copies. We also obtained a linear standard curve 
with the GAPDH TaqMan™ probe using the serially diluted brain cDNA. Relative GAPDH expression 
levels were in the range of 0.4 to 10.2 % of that observed in skeletal muscle, which of all tissues tested 
had the highest GAPDH expression level. Relative IGS4 expression levels were expressed as a 
proportion of the level detected in spinal cord, which had the highest IGS4 expression of all tissues 
tested (Fig,8). We also plotted relative IGS4 expression levels after normalization for expression of the 
GAPDH house keeping gene (Fig.8). 

Example 5b. Results. 



Q-PCR using an IGS4 specific TaqMan probe showed that highest expression levels (without 
normalization for GAPDH) were found in spinal cord. IGS4 expression levels in spinal cord amounted to 

15 11,467 copies mRNA / ng pA RNA (assuming 100 % efficiency of the cDNA synthesis reaction and 
assuming that pA RNA constitutes 2% of total RNA). High IGS4 expression levels were also found in 
brain (41 % of spinal cord levels), skeletal muscle (37%), cerebellum (31%). testis (19%) and in lung 
(12%) and heart (11%). Lower levels were found in fetal brain (5%), trachea (4%). prostate (2%) and 
thyroid (1.4%). After normalization for GAPDH expression, the relative IGS4 expression pattern 

20 remained largely unchanged with the exception of skeletal muscle, where the relative expression level 
dropped to 2% of that in spinal cord. As it is not clear whether normalization for GAPDH is a valid 
procedure (GAPDH expression levels can be expected to vary more or less in different cell/ tissue 
types) we prefer to focus on the non-normalized relative expression levels. 

These Q-PGR data seem to be in line with expression data from RNA array (Example 3) and 

25 Northem blot (Example 4) hybridization experiments in the sense that testis, spinal cord and brain 
appear to be among the most prominent expression sites. However Q-PCR analysis additionally shows 
important expression In a number of other tissues, such as skeletal muscle, cerebellum, lung and heart. 

The ligand neuromedin U has been proposed to be a neuropeptide or neuromodulator, without the 
30 knowledge of the specific receptor (Domin J., Ghatei M. A.. Chohan P. and Bloom S. R. (1987), 
Peptides fi: 779-784). Our investigation shows, that IGS4 is a novel member of the neuromedin U- 
receptor family being expressed In CNS and PNS regions, the gastrointestinal. Immunological, 
genitourinary and cardiovascular system, skeletal muscle, thyroid, and lung. 



Table 8: Overview of the PGR oligonucleotide primers and TaqMan probe used In the IGS4 
Q-PCR reactions. 



SEQ ID NO: 32 IP 14.963 5'-GGTCTTCAGCCTGGCGGTCTCTG-3' 
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SEQ ID NO: 33 


IP 14,964 


5'- GGAGGCGAAGCACACGGTCTCA-3' 


SEQ ID NO: 34 


IP 14,962 


5'(FAM)-AGATGTGGCGCAACTACCCTTTCTTGTTCGGGCC- 
(TAMRA)3' 



All publications, including but not limited to patents and patent applications, cited in this specification are 
herein incorporated by reference as if each individual publication were specifically and individually 
indicated to be incorporated by reference herein as though fully set forth. 
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Fig.1 Schematic representation of the relative positions of the different cDNA clones that were 
isolated to generate the consensus IGS4 cDNA sequence. 5' and 3' RACE primers that were 
5 used are also indicated (IGS4R# and IGS4F# respectively) as well as the position of EST 

accession n*' N45474. Primer IGS4R6 was located within intron 1. Some clones (e.g. HNT2311, 
HNT2312 and HNT2253) were only partially sequenced (only the part that was sequenced is 
indicated). CONSENSUS A and CONSENSUS B denote the consensus sequence of IGS4 
allelic types A and B respectively. The nucleotide that was identified at each of the 4 
10 polymorphic positions Is indicated (shaded boxes) for each clone. "S" indicates a sequence 

ambiguity in clones HNT2211 and HNT2212 and means either "C" or "T\ The coding area of 
IGS4A and IGS4B consensus sequences is indicated with "**". As there were some remaining 
sequence ambiguities in the 5* end of the consensus sequence, the IGS4ADNA and IGS4BDNA 
sequences have only been taken from position 86 until the end 

15 

Fig.2 Schematic representation of the relative positions of different DNA database entries compared 
to the IGS4 cDNA sequence. The IGS4 cDNA sequence is indicated with the boxes (the 
position of the IGS4 coding sequences is indicated with the filled boxes). The relative position of 
iGS4 exons 1-4 is indicated above the IGS4 cDNA sequence ("=="). The parts of the genomic 

20 sequence AC008571 that encode exons 1->4 are indicated with AC008571a->d respectively. 

The position of these fragments within the AC008571 sequence are: AC008571a (13129-13908 
of the reverse complement of AC008571). AC008571b (51676-51760 of AC008571), 
AC008571C (79978-80103 of the reverse complement of AC008571) and AC008571d (83060- 
83728 of the reverse complement of AC008571). G05725 and G20615 are STS (sequence 

25 tagged sequence) entries whereas F05107, F05108, F07531, R13353, R13890, H11359, 

N45474. W61169. AI432384, W61131. AI023570. F01358, F03770. 238158. R40869. R37725, 
H1 1333 are EST entries. The parts of genomic clones AQ019411 and AQ015065 that contain 
IGS4 exon 2 are indicated with The 5* part of EST sequences F05107, F05108. F07531. 
R13353, R13890 and H11359 which is totally different from the IGS4 cDNA sequence is 

30 indicated with "*". AQ078563 is a genomic clone. 

Fig.3 : IGS4 receptor activation by different Neuromedin U isoforms. CHOG 16-IGS4B cells were 
cultured in 96-well plates overnight and loaded with Fluo-4AM. The receptor mediated Ca^* 
changes were measured with FLIPR (Molecular Devices). Maxima of the fluorescence change 
35 detected by the CCD camera were normalised to 1 and are depicted as counts. 

Fig. 3a: results for neuromedin U-8; 
Fig. 3b: results for neuromedin U-23; 
Fig. 3c: results for neuromedin U-25. 
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Fig.4 Neuromedin U-23 induced intracellular Ca^* mobilisation in CHOG 16-cells expressing IGS4B. 
Application of 10nM Neuromedin U-23 to the cell lines CHOG 16-IGS4, CHOG 16 and CHOG 
16 transfected with an other orphan GPCR. Cells were cultured in 96-weII plates overnight and 
loaded with FIuo-4AM. Receptor mediated intracellular Ca^^ changes were measured with 
FLIPR (Molecular Devices), depicted in counts detected by the CCD camera. 

Fig. 5 Human multiple tissue expression array using a human IGS4 probe. 

Fig. 6 Northern blot analysis using an IGS4 probe. 

Fig. 7 IGS4 expression analysis (MTE blot). 

Fig. 8 Relative expression levels of IGS4 mRNA as compared to the expression observed In spinal 
cord. Both non-normalized and GAPDH-normalized expression levels are shown. 
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Claims 

1 . An isolated polynucleotide comprisfng a nucleotide sequence selected from the group 
consisting of: 

a) a nucleotide sequence encoding the IGS4 polypeptide according to SEQ ID 
NO: 2, SEQ ID NO: 4, SEQ ID NO: 6 or SEQ ID NO: 8; 

b) a nucleotide sequence encoding the polypeptide encoded by the DNA 
insert contained in the deposit no. CBS1 02221 or the deposit no. 
CBS1 02222 at the Cenlraalbureau voor Schimmelcultures at Baarn the 
Netherlands, in particular a nucleotide sequence comprising SEQ ID NO: 1 , 
SEQ ID NO: 3, SEQ ID NO: 5 or SEQ ID NO: 7; 

c) a nucleotide sequence having at least 80 % (preferably at least 90%) 
sequence identity over Its entire length to the nucleotide sequence of (a) or 
(b); 

d) a nucleotide sequence which Is complimentary to the nucleotide sequence 
of (a) or (b) or (c). 

2. The polynucleotide of claim 1 wherein said polynucleotide comprises the nucleotide 
sequence contained in SEQ ID NO: 1 encoding the IGS4 polypeptide of SEQ ID 
NO: 2 or the nucleotide sequence contained in SEQ ID NO: 3 encoding the IGS4 
polypeptide of SEQ ID NO: 4 or the nucleotide sequence contained in SEQ ID NO: 
5 encoding the IGS4 polypeptide of SEQ ID NO: 6 or the nucleotide sequence 
contained in SEQ ID NO: 7 encoding the IGS4 polypeptide of SEQ ID NO: 8. 

3. The polynucleotide of claim 1 wherein said polynucleotide comprises a nucleotide 
sequence that is at least 80% identical to that of SEQ ID NO: 1, SEQ ID NO: 3, 
SEQ ID NO: 5 or SEQ ID NO: 7 or to the sequence of the DNA insert contained In 
the deposit no. CBS102221 or the deposit no. CBS102222 at the Centraalbureau 
voor Schimmelcultures at Baam the Netherlands over its entire length. 

4. The polynucleotide of claim 3 which is the polynucleotide of SEQ ID NO: 1, SEQ ID 
NO: 3, SEQ ID NO: 5 or SEQ ID NO: 7 or the DNA insert contained in the deposit 
no. CBS1 02221 or the deposit no. CBS1 02222 at the Centraalbureau voor 
Schimmelcultures at Baam the Netherlands. 

5. The polynucleotide of claim 1-4 which is DNA or RNA. 
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6. An isolated nucleotide sequence encoding an IGS4 neuromedin receptor protein 
comprising the amino acid sequence set forth in SEQ ID NO:2, SEQ ID NO:4. SEQ 
ID NO:6 or SEQ ID NO:8, preferably a mammalian neuromedin receptor protein, 
said protein exhibiting high affinity binding for neuromedin U, preferably for 
neuromedin U-8, for neuromedin U-23 and/or for neuromedin U-25. 

7. An isolated nucleotide sequence of claim 6 encoding an iGS4 neuromedin receptor 
protein comprising the amino acid sequence set forth in SEQ ID NO:2, SEQ ID 
NO:4, SEQ ID NO:6 or SEQ ID NO:8, said protein exhibiting expression in brain, 
skeletal muscle, cerebellum, testis, corpus callosum. spinal cord, substantia nigra, 
medulla, thalamus, caudate nucleus, pons, nucleus accumbens, fetal brain, 
stomach, heart, thyroid gland, lung, thymus, prostate and/or In trachea. 

8. An isolated nucleotide sequence encoding an IGS4 neuromedin receptor protein 
comprising the amino acid sequence set forth in SEQ ID NO:2. SEQ ID NO:4, SEQ 
ID NO:6 or SEQ ID NO:8. preferably a mammalian neuromedin receptor protein, 
said protein exhibiting high affinity binding for neuromedin U, preferably for 
neuromedin U-8. for neuromedin U-23 and/or for neuromedin U-25, said protein 
exhibiting expression In brain, skeletal muscle, cerebellum, testis, corpus callosum. 
spinal cord, substantia nigra, medulla, thalamus, caudate nucleus, pons, nucleus 
accumbens, fetal brain, stomach, heart, thyroid gland, lung, thymus, prostate and/or 
in trachea, and said nucleotide sequence being selected from the group of 
nucleotide sequences as defined in the claims 1 to 5. 

9. A DNA or RNA molecule comprising an expression vector, wherein said expression 
vector is capable of producing an IGS4 polypeptide comprising an amino acid 
sequence, which has at least 80% identity with the polypeptide of SEQ ID NO: 2. 
SEQ ID NO: 4. SEQ ID NO: 6 or SEQ ID NO: 8 or with the polypeptide encoded by 
the DNA insert contained in the deposit no. CBS1 02221 or the deposit no. 
CBS1 02222 at the Centraalbureau voor Schimmelcultures at Baarn the 
Netherlands, when said expression vector is present in a compatible host cell. 



10. An isolated DNA or RNA molecule comprising an expression vector, wherein said 
expression vector is capable of producing an IGS4 neuromedin receptor protein 
comprising the amino acid sequence set forth in SEQ ID NO:2. SEQ ID NO:4, SEQ 
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ID NO:6 or SEQ ID NO:8, preferably a mammalian neuromedin receptor protein, 
said protein exhibiting high affinity binding for neuromedin U. preferably for 
neuromedin U-8, for neuromedin U-23 and/or for neuromedin U-25, and exhibiting 
expression In brain, skeletal muscle, cerebellum, testis, corpus catlosum, spinal 
cord, substantia nigra, medulla, thalamus, caudate nucleus, pons, nucleus 
accumbens, fetal brain, stomach, heart, thyroid gland, lung, thymus, prostate and/or 
in trachea. 

11. A host cell comprising the expression system of claim 9 or 1 0. 

12. A host cell according to claim 1 1 which is a yeast cell. 

13. A host cell according to claim 1 1 which is an animal cell. 

14. IGS4 receptor membrane preparation derived from a cell according to claim 11-13. 

15. A process for producing an IGS4 polypeptide comprising culturing a host of claim 
11-13 under conditions sufficient for the production of said polypeptide and 
recovering the polypeptide from the culture. 

16. A process for producing a cell which produces an 1GS4 polypeptide comprising 
transforming or transfecting a host cell with the expression system of claim 9 or 10 
such that the host cell, under appropriate culture conditions, produces an IGS4 
polypeptide. 

17. An IGS4 polypeptide comprising an amino acid sequence which is at least 80% 
identical to the amino add sequence of SEQ ID NO: 2. SEQ NO: 4. SEQ NO: 6 or 
SEQ NO: 8 or to the polypeptide encoded by the DNA Insert contained in the 
deposit no. CBS1 02221 or the deposit no. CBS1 02222 at the Centraalbureau voor 
Schimmelcultures at Baam the Netheriands over its entire length. 

18. The polypeptide of claim 17 which comprises the amino acid sequence of SEQ ID 
NO: 2, SEQ ID NO: 4, SEQ NO: 6 or SEQ NO: 8 or the amino acid sequence 
encoded by the DNA Insert contained in the deposit no. CBS1 02221 or the deposit 
no. CBS1 02222 at the Centraalbureau voor Schimmelcultures at Baarn the 
Netherlands. 
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19. An isolated IGS4 neuromedin receptor protein comprising the amino acid sequence 
set forth in SEQ ID NO:2. SEQ ID NO:4. SEQ ID NO:6 or SEQ ID NO:8, preferably 
of a mammalian neuromedin receptor protein, said protein exhibiting high affinity 
binding for neuromedin U, preferably for neuromedin U-8, for neuromedin U-23 
and/or for neuromedin U-25. 

20. An Isolated IGS4 neuromedin receptor protein of claim 19 comprising the amino 
acid sequence set forth in SEQ ID NO:2. SEQ ID NO:4. SEQ ID NO:6 or SEQ ID 
NO:8, said protein exhibiting expression in brain, skeletal muscle, cerebellum, 
testis, corpus callosum, spinal cord, substantia nigra, medulla, thalamus, caudate 
nucleus, pons, nucleus accumbens, fetal brain, stomach, heart, thyroid gland, lung, 
thymus, prostate and/or in trachea. 

21. An isolated IGS4 neuromedin receptor protein comprising the amino acid sequence 
set forth in SEQ ID NO:2, SEQ ID.NO:4, SEQ ID NO:6 or SEQ ID NO:8, preferably 
a mammalian neuromedin receptor protein, said protein exhibiting high affinity 
binding for neuromedin U. preferably for neuromedin U-8, for neuromedin U-23 
and/or for neuromedin U-25, said protein exhibiting expression in brain, skeletal 
muscle, cerebellum, testis, corpus callosum, spinal cord, substantia nigra, medulla, 
thalamus, caudate nucleus, pons, nucleus accumbens. fetal brain, stomach, heart, 
thyroid gland, lung, thymus, prostate and/or in trachea, and said amino acid 
sequence being selected from the group of amino acid sequence as defined in the 
claims 17-18. 

22. An antibody immunospecific for the IGS4 polypeptide comprising the amino acid 
sequence set forth in SEQ ID NO:2, SEQ ID NO:4. SEQ ID NO:6 or SEQ ID NO:8 
of claim 17-21. 



23. Use of: 

(a) a therapeutically effective amount of an agonist to the IGS4 neuromedin 
receptor polypeptide comprising the amino acid sequence set forth in SEQ ID NO;2, 
SEQ ID NO:4, SEQ ID NO:6 or SEQ ID NO:8 of claim 17-21; and/or 

(b) an isolated polynucleotide comprising a nucleotide sequence that has at 
least 80% identity to a nucleotide sequence encoding the IGS4 polypeptide of SEQ 
ID NO: 2. SEQ ID NO: 4. SEQ NO: 6 or SEQ NO: 8 or the polypeptide encoded by 
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the DNA insert contained in the deposit no. CBS1 02221 or the deposit no. 
CBS1 02222 at the Centraalbureau voor Schimmelcultures at Baam the Netherlands 
over its entire length; or a nucleotide sequence complementary to one of said 
nucleotide sequences in a form so as to effect production of said receptor activity in 
vivo; and/or 

(c) an isolated polynucleotide comprising a nucleotide sequence that encodes 
an IGS4 neuromedin receptor protein comprising the amino acid sequence set forth 
in SEQ ID NO:2, SEQ ID NO:4. SEQ ID NO:6 or SEQ ID NO:8, preferably a 
mammalian IGS4 neuromedin receptor protein, said protein exhibiting high affinity 
binding for neuromedin U, preferably for neuromedin U-8, for neuromedin U-23 
and/or for neuromedin U-25, 

for the preparation of a medicament for the treatment of a subject suffering from a 
disease related to expression or activity of the IGS4 neuromedin receptor 
polypeptide, in need of enhanced activity or expression of IGS4 polypeptide 
comprising the amino acid sequence set forth in SEQ ID NO:2. SEQ ID NO:4. SEQ 
ID NO:6 or SEQ ID NO:8 of claim 17-21 . 



24. Use of: 

(a) a therapeutically effective amount of an antagonist to the iGS4 neuromedin 
receptor polypeptide comprising the amino acid sequence set forth in SEQ ID NO:2, 
SEQ ID NO:4. SEQ ID NO:6 or SEQ ID NO:8 of claim 17-21; and/or 

(b) a nucleic acid molecule that inhibits the expression of the nucleotide 
sequence encoding the IGS4 neuromedin receptor polypeptide comprising the 
amino acid sequence set forth in SEQ ID NO:2. SEQ ID NO:4, SEQ ID NO:6 or 
SEQ ID NO:8 of claim 17-21; and/or 

(c) a therapeutically effective amount of a polypeptide that competes with the 
IGS4 neuromedin receptor polypeptide comprising the amino acid sequence set 
forth in SEQ ID NO:2. SEQ ID NO:4. SEQ ID NO:6 or SEQ ID NO:8 of claim 17-21 
for its ligand, 

for the preparation of a medicament for the treatment of a subject suffering from a 
disease related to expression or activity of the IGS4 neuromedin receptor 
polypeptide, having need to Inhibit activity or expression of IGS4 polypeptide 
comprising the amino acid sequence set forth in SEQ ID NO:2. SEQ ID NO:4, SEQ 
ID NO:6 or SEQ ID NO:8 of claim 17-21. 
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25. A process for diagnosing a disease or a susceptibility to a disease in a subject 
related to expression or activity of the IGS4 neuromedin receptor polypeptide 
comprising the amino acid sequence set forth in SEQ ID NO:2. SEQ ID NO:4. SEQ 
ID NO:6 or SEQ ID NO:8 of claim 17*21 in a subject, comprising: 

(a) determining the presence or absence of a mutation in the nucleotide 
sequence encoding said iGS4 polypeptide in the genome of said subject in a 
sample derived from said subject; and/or 

(b) analyzing for the presence or amount of said IGS4 polypeptide expression 
in a sample derived from said subject. 

26. A method for identifying agonists to the IGS4 polypeptide comprising the amino acid 
sequence set forth in SEQ ID NO:2, SEQ ID NO:4. SEQ ID NO:6 or SEQ ID NO:8 
of claim 17-21, comprising: 

(a) contacting a cell which produces said IGS4 polypeptide with a test 
compound; and 

(b) determining whether the test compound effects a signal generated by 
activation of the IGS4 polypeptide. 

27. An agonist identified by the method of claim 26. 

28. A method for identifying agonists to the iGS4 neuromedin receptor protein 
comprising the amino acid sequence set forth in SEQ ID NO:2, SEQ ID NO:4. SEQ 
ID NO:6 or SEQ ID NO:8, preferably to the mammalian IGS4 neuromedin receptor 
protein, said protein exhibiting high affinity binding for neuromedin U, preferably for 
neuromedin U-8, for neuromedin U-23 and/or for neuromedin U-25, comprising: 

(a) contacting a cell which produces said IGS4 neuromedin receptor protein 
with a test compound; and 

(b) determining whether the test compound effects a signal generated by 
activation of the IGS4 neuromedin receptor protein. 

29. A method for identifying agonists to the IGS4 neuromedin receptor protein 
comprising the amino acid sequence set forth in SEQ ID N0:2, SEQ ID NO:4, SEQ 
ID NO:6 or SEQ ID NO:8 according to claim 28, wherein said agonists are effective 
with regard to disorders of the riervous system, including the central nervous 
system (CNS) and the peripheral nervous system (PNS), disorders of the 
gastrointestinal system and/or of the cardiovascular system and/or of skeletal 
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muscle and/or of the thyroid, and/or also to lung diseases, immunological diseases 
and disorders of the genitourinary system. 

30. An agonist identified by the method of claim 28 or 29, preferably an agonist being 
effective with regard to disorders of the nervous system, including the central 
nervous system (CNS) and the peripheral nervous system (PNS), disorders of the 
gastrointestinal system and/or of the cardiovascular system and/or of skeletal 
muscle and/or of the thyroid, and/or also to lung diseases, immunological diseases 
and disorders of the genitourinary system. 

31. A method for identifying antagonists to the IGS4 polypeptide comprising the amino 
acid sequence set forth in SEQ ID NO:2. SEQ ID NO:4, SEQ ID NO:6 or SEQ ID 
NO;8 of claim 17-21 comprising: 

(a) contacting a cell which produces said 1GS4 polypeptide with an agonist; and 

(b) determining whether the signal generated by said agonist is diminished in 
the presence of a candidate compound. 

32. An antagonist identified by the method of claim 31 . 

33. A method for identifying antagonists to the IGS4 neuromedin receptor protein 
comprising the amino acid sequence set forth in SEQ ID NO:2. SEQ ID NO:4. SEQ 
ID NO:6 or SEQ ID NO:8, preferably to the mammalian IGS4 neuromedin receptor 
protein, said protein exhibiting high affinity binding for neuromedin U, preferably for 
neuromedin U-8, for neuromedin U-23 and/or for neuromedin U-25, comprising: 

(a) contacting a cell which produces said IGS4 neuromedin receptor protein 
with an agonist; and 

(b) determining whether the signal generated by said agonist is diminished in 
the presence of a candidate compound. 

34. A method for identifying antagonists to the IGS4 neuromedin receptor protein 
comprising the amino acid sequence set forth in SEQ ID NO:2, SEQ ID NO:4, SEQ 
ID NO:6 or SEQ ID NO:8 according to claim 33, wherein said antagonists are 
effective with regard to disorders of the nervous system, including the central 
nervous system (CNS) and the peripheral nervous system (PNS), disorders of the 
gastrointestinal system and/or of the cardiovascular system and/or of skeletal 



AMENDED SHEET 



21-09-2001 SPW99.06 



»t - 11 .1"--*. f»'^*H *"~ft tl St SI ti » ! -"-f ""-J ■ift'^-- -""'ll 



EP0009584 



8 



muscle and/or of the thyroid, and/or also to lung diseases, immunological diseases 
and disorders of the genitourinary system. 

35. An antagonist Identified by the method of claim 33 or 34, preferably an antagonist 
being effective with regard to disorders of the nervous system, including the central 
nervous system (CNS) and the peripheral nervous system (PNS), disorders of the 
gastrointestinal system and/or of the cardiovascular system and/or of skeletal 
muscle and/or of the thyroid, and/or also to lung diseases, immunological diseases 
and disorders of the genitourinary system. 

36. A recombinant host cell produced by a method of claim 16 or a membrane thereof 
expressing an IGS4 polypeptide. 

37. A method of creating a genetically modified non-human animal for 1GS4 
neuromedin receptor related disorders comprising the steps of: 

(a) ligating the coding portion of a nucleic acid molecule, consisting 
essentially of a nucleic acid sequence encoding a protein having the amino acid 
sequence SEQ ID NO: 2, SEQ ID NO: 4, SEQ NO: 6 or SEQ NO: 8 or the amino 
acid sequence encoded by the DNA insert contained in the deposit no. CBS1 02221 
or the deposit no. CBS1 02222 at the Centraalbureau voor Schimmelcultures at 
Baam the Netherlands, to a regulatory sequence which is capable of driving high 
level gene expression or expression in a cell type in which the gene is not normally 
expressed in said animal; or 

(b) isolation and engineering the coding portion of a nucleic acid molecule, 
consisting essentially of a nucleic acid sequence encoding a protein having the 
amino acid sequence SEQ ID NO: 2. SEQ ID NO: 4. SEQ NO: 6 or SEQ NO: 8 or 
the amino acid sequence encoded by the DNA insert contained in the deposit no. 
CBS1 02221 or the deposit no. CBS1 02222 at the Centraalbureau voor 
Schimmelcultures at Baarn the Netherlands, and reintroducing said sequence in the 
genome of said animal in such a way that the endogenous gene alleles, encoding a 
protein having the amino acid sequence SEQ ID NO: 2. SEQ ID NO; 4, SEQ NO: 6 
or SEQ NO: 8 or the amino acid sequence encoded by the DNA insert contained in 
the deposit no. CBS1 02221 or the deposit no. CBS1 02222 at the Centraalbureau 
voor Schimmelcultures at Baann the Netherlands, are fully or partially inactivated. 
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38. A method of determining whether a substance is a potential ligand of IGS4 receptor 
comprising the amino acid sequence set forth in SEQ ID NO:2. SEQ ID NO:4. SEQ 
ID NO:6 or SEQ ID NO:8. comprising: 

(a) contacting cells expressing the receptor of one of the claims 1 7-21 or one 
of SEQ ID NO:2. SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8, or contacting a 
receptor membrane preparation comprising one of said receptors of one of the 
claims 17-21 or one of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6 and SEQ ID 
NO:8 with labeled neuromedin U in the presence and in the absence of the 
substance; and 

(b) measuring the binding of neuromedin U to said IGS4. 

39. A polypeptide according to any of the claims 17-21. further being characterized in 
that said polypeptide binds neuromedin U, preferably neuromedin U-8, neuromedin 
U-23 and/or neuromedin U-25, showing at least an affinity of about log EC5o=-6. 

40. A polypeptide according to any of the claims 17-21 . further being characterized in 
that said polypeptide binds neuromedin U, preferably neuromedin U-8, neuromedin 
U-23 and/or neuromedin U-25, showing at least an affinity of about log EC5o=-9. 
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(54) Title: NOVEL HUMAN G-PROTEIN COUPLED RECEPTOR 

(57) Abstract: The present invention relates to novel identified polynucleotides, polypeptides encoded by them and to the use of such 
polynucleotides and polypeptides, and to their production. More particularly, the polynucleotides and polypeptides of the present 
invention relate to the G-protein coupled receptor family, referred to as IGS4-family. The invention also relates to inhibiting or acti- 
vating the action of such polynucleotides and polypeptides, to a vector containing said polynucleotides, a host cell containing such 
vector and transgenic animals where theIGS4-gene is either overexpressed, misexpressed, underexpressed or suppressed (knock-out 
animals). The invention further relates to a method for screening compounds capable to act as an agonist or an antagonist of said 
G-protein coupled receptor family IGS4 and the use of IGS4 polypeptides and polynucleotides and agonists or antagonists to the 
IGS4 receptor family in the treatment of PNS, psychiatric and CNS disorders, including schizophrenia, episodic paroxysmal anxi- 
ety (EPA) disorders such as obsessive compulsive disorder (OCD), post traumatic stress disorder (PTSD), phobia and panic, major 
depressive disorder, bipolar disorder, Parkinson's disease, general anxiety disorder, autism, delirium, multiple sclerosis, Alzheimer 
disease/dementia and other neurodegenerative diseases, severe mental retardation, dyskinesias, Huntington* s disease, Tourett^s syn- 
drome, tics, tremor, dystonia, spasms, anorexia, bulimia, stroke, addiction/dependency/craving, sleep disorder, epilepsy, migraine; 
attention deficit/hyperactivity disorder (ADHD); cardiovascular diseases, including heart failure, angina pectoris, arrhythmias, my- 
ocardial infarction, cardiac hypertrophy, hypotension, hypertension - e.g. essential hypertension, renal hypertension, or pulmonary 
hypertension, thrombosis, arteriosclerosis, cerebral vasospasm, subarachnoid hemorrhage, cerebral ischemia, cerebral infarction, 
peripheral vascular disease, Raynaud's disease, kidney disease - e.g. renal failure; dyslipidemias; obesity; emesis; gastrointestinal 
disorders, including irritable bowel syndrome (IBS), inflammatory bowel disease (IBD), gastroesophagal reflux disease (GERD), 
motility disorders and conditions of delayed gastric emptying, such as post operative or diabetic gastroparesis, and diabetes, ulc^s 
- e.g. gastric ulcer; diarrhoea; other diseases including osteoporosis; inflanunations; infections such as bacterial, fungal, protozoan 
and viral infections, particularly infections caused by HIV-l or HlV-2; pain; cancers; chemotherapy induced injury; tumor invasion; 
immune disorders; urinary retention; asthma; alleigies; arthritis; benign prostatic hypertrophy; endotoxin shock; sepsis; complica- 
tion of diabetes mellitus; and gynaecological disorders, among others and diagnostic assays for such conditions. Preferred uses of the 
invention relate to disorders of the nervous system, including the central nervous system (CNS) and the peripheral nervous system 
(PNS), disorders of the gastrointestinal system and/or of the cardiovascular system and/or of skeletal muscle and/or of the thyroid, 
and/or also to lung diseases, immunological diseases and disorders of the genitourinary system. The invention also relates to the 
identification of the cognate ligand of the IGS4 polypeptides of the invention. High affinity binding to said IGS4 polypeptides is 
found for the neuropeptides known as neuromedin U. 
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Worldwide Rights 

DECLARATION AND POWER OF ATTORNEY 

As a below nanned inventor, lAA/e hereby declare that: my residence, post office 
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and sole inventor (if only one name is listed below) or an original, first, and joint inventor (if 
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is sought on the invention entitled: Human G-protein coupled receptor 

the specification of which is attached and/or im was filed on 

as United States Application Serial No. or 

PCT International Application No. PCT/E POO/09584 and was amended on 14 Sept embe r 2001 

i/We hereby state that lAA/e have reviewed and understand the contents of the 
above-identified specification, including the claims, as amended by any amendment referred 
to above. lA/Ve acknowledge the duty to disclose information which is material to 
patentability as defined in 37 CFR § 1 .56. 

I/We hereby claim foreign priority benefits under 35 U.S.C. § 1 19(a)-(d) or § 365(b) of 
any foreign application(s) for patent or inventor's certificate or § 365(a) of any PCT 
international application(s) designating at least one country other than the United States, 
listed below and have also identified below, any foreign application(s) for patent or inventor's 
certificate, or any POT International application(s) having a filing date before that of the 
application(s) of which priority is claimed: 



Country 


Application 

Number 


Date of Filing 


Priority Claimed Under 
35 U.S.C. 119 


EP (NL designated) 


99203140.1 


24 September 1999 


X YES 
NO 


NL 


1013140 


24 September 1999 


X YES 
NO 


EP (NL designated) 


00202683.9 


28 July 2000 


X YES 
NO 



I/We hereby claim the benefit under 35 U.S.C. § 1 19(e) of any United States 
provisional application(s) listed below: 



Application Number 


Date of Filing 


66/222,047 


31 July 2000 
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Finnegan, Henderson, Farabow, Garrett & Dunner, L.L.P. • Washington, D.C. 

Attorney Docket No.: SPW99.06 US 
Declaration/Power of Attorney 
Worldwide Rights 

DECLARATION AND POWER OF ATTORNEY 

As a below named inventor, I/We hereby declare that: my residence, post office 
address and citizenship are as stated below next to my name; I believe I am the original, first, 
and sole inventor (if only one name is listed below) or an original, first, and joint inventor' (if 
plural names are listed below) of the subject matter which is claimed and for which a patent 

is sought on the invention entitled: Human G-protein coupled receptor 

the specification of which i® is attached and/or i^ was filed on 

as United States Application Serial No. or 

PCT International Application No. PCT/EP00/Q9584 and was amended on s ppt o mbor- 2i 

I/We hereby state that I/We have reviewed and understand the contents of the 
above-identified specification, including the claims, as amended by any amendment referred 
to above. I/We acknowledge the duty to disclose information which is material to 
patentability as defined in 37 CFR § 1.56. 

I/We hereby claim foreign priority benefits under 35 U.S.C. § 1 19(a)-(d) or § 365(b) of 
any foreign application(s) for patent or inventor's certificate or § 365(a) of any PCT 
international application(s) designating at least one country other than the United States, 
listed below and have also identified below, any foreign application(s) for patent or inventor's 
certificate, or any PCT International application(s) having a filing date before that of the 
application(s) of which priority is claimed: 



Country 


Application 
Number 


Date of Filing 


Priority Claimed Under 
35 U.S.C. 119 


EP (NL designated) 


99203140.1 


24 September 1999 


X YES 
NO 


NL 


1013140 


24 September 1999 


X YES 
NO 


EP (NL designated) 


00202683.9 


28 July 2000 


X YES 
NO 



I/We hereby claim the benefit under 35 U.S.C. § 1 19(e) of any United States 
provisional application(s) listed below: 



Application Number 


Date of Filing 


66/222,047 


31 July 2000 








Finnegan, Henderson, Farabow, Garrett & Dunner, L.L.P. • Washington, D.C. 



I/We hereby claim the benefit under 35 U.S.C. § 120 of any United States 
application(s) or § 365(c) of any PCT International applicatlon(s) designating the United 
States, listed below and, insofar as the subject matter of each of the claims of this application 
is not disclosed in the prior United States or PCT International application(s) in the manner 
provided by the first paragraph of 35 U.S.C. § 1 12, I acknowledge the duty to disclose 
information which is material to patentability as defined in 37 CFR § 1 .56 which became 
available between the filing date of the prior application(s) and the national or PCT 
International filing date of this application: 



Application Number 


Date of Filing 


Status (Patented, 
Pending, Abandoned) 















l/We hereby appoint the following attorney and/or agent(s) to prosecute this 
application and transact ail business in the Patent and Trademark Office connected 
therewith. FINNEGAN, HENDERSON, FARABOW, GARRETT & DUNNER, L.L.P., Douglas 

B. Henderson, Reg. No. 2.0.,291; Ford F. Farabow, Jr., Reg. No. 20.630; Arthur S. Garrett, 
Reg. No. 2CL338^Donald R. Dunner, Reg. No. 1 9.073; Bri an G. Bmnsvold, Reg. No. 22. 593; 
Tipton D. Jennings, IV, Reg. No. 20,645; Je rry D. Voight, Reg. No. 23.020 ; Laurence R. 
Hefter, Reg. No. 2(l,a2ZLKenneth E. Payne, Reg. No. 23 .098: H erbert H. Mintz, Reg. No. 
^6M1^C. Larry O'Rourke, Reg. No. 26,014; Albert J. Santorelli, Reg. No. 2 2,610;. M ichael 

C. Elmer, Reg. No. 2.5,S5Z;_Richard H. STrfiTFTTReg. No. 20.609; _Stephen L. Peterson, Reg. 
No. 2&,325;-John M. Romary, Reg. No. 2 6,331; Br uce C. Zotter, Reg. No. 27,680;J3ennis P. 
O'Reilley, Reg. No. 27 ^9 3 2 ; Allen M. Sokal, Reg. No. 26,695; R obert D. Bajefsky, Reg. No. 
25,3aZ; Richard L. Stroup, Reg. No. 28.478; David W. Hill, Reg. No. 2 8,220; Thomas L. 
Irving, Reg. No. 28 ._619: C harles E. Lipsey, Reg. No. 2 8.165; Thomas W. Winland, Reg. No. 
2Z305^ Basil J. Lewris, Reg. No. 28.818; Martin I. Fuchs. Reg. No. 28. 508: E . Robert 
Yoches. Reg. No. 30.170: Barry W. Graham, Reg. No. 29,924; Susan Haberman Griffen, 
Reg. No. SOSDZuRichard B. Racine, Reg. No. 30,415; T homas H. Jenkins. Reg. No. 3 0.857; 
Robert E. Converse, Jr., Reg. No. 27.432i,Clair X. Mullen, Jr., Reg. No. 2 0,348; C hristopher 
P. Foley, Reg. No. 31Ji54rJ©hn C. Paul, Reg. No. 30,413; Roger D. Taylor, Reg. No. 
28,992; David M. Kelly, Reg. No. 3 0.953: Kenneth J. Meyers. Reg. No. 25,146; C arol P. 
Einaudi, Reg. No. 32.220 :J /Valter Y. Boyd, Jr.. Reg. No. 3U3a; Steven M. Anzalone, Reg. 
No. 32Jm; Jean B. Fordis, Reg. No. 32.384; Barbara C. McCurdy, Reg. No. 3 2.120; J ames 
K. Hammond. Reg. No. 3jL2fi4i-Richard V. Burgujian. Reg. No. 3 1,744; J. Michael Jakes, 
Reg. No. 32^^8242 Dirk D. Thomas, Reg. No. 32.60 0; Thomas W. Banks, Reg. No. 32,719; 
Christopher P. Isaac, Reg. No. 32,6jl6; Bryan C. Diner, Reg. No. 32,409; M. Paul Barker, 
Reg. No. 32,013; Andrew Chanho Sonu, Reg. No.^,457^David S. Forman, Reg. No. 
3^j694; Vincent P. Kovalick, Reg. No. 32367Hames W. Edmondson, Reg. No. 3 3,871; 
Michael R. McGurk, Reg. No. 3 2.045 LJoann M. Neth, Reg. No. 3 6.363: G erson S. Panitch, 
Reg. No. 3^51; Cheri M. Taylor, Reg. No. 33.,2J£; Chartes E. Van Horn, Reg. No. 40,266; 
Linda A. WadlerTlReg. No. 33,218iJeffrey A. Berkowitz, Reg. No. 36,243^Michael R. Kelly. 
Reg. No. ^921 ; a nd James B. Monroe, Reg. No. 3 3.971 ; and . 
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L.L.P. 1 300 I Street. N.W.. Washington. D.C. 20005. Telephone N o. (202) 408-4000. 
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Finnegan, Henderson. Farabow, Garrett & Dunner, L.L.P. • Washington, D.C. 



I/We hereby declare that all statements made herein of my own knowledge are true 
and that all statements made on information and belief are believed to be true; and further 
that these statements were made with the knowledge that willful false statements and the like 
so made are punishable by fine or imprisonment, or both, under section 1 001 of Title 1 8 of 
the United States Code, and that such willful false statements may jeopardize the validity of 
the application or any patent issuing thereon. 

IN TESTIMONY WHEREOF, I/We have hereunto set our hands. 



Full Name of First Inventor: 
Willy Deleersniider 


Inventor's Signature: 


Date: 


Residence: Weesp, The Netherlanc 




Citizenship: 
Belgian 


Post Office Address: 

C.J. van Houtenlaan 36. 1381 CP Weesp, The Netherlands 


Full Name of Second Inventor: 
Guy Nys 


Inventor's Signature: 


Date: 


Residence: Weesp, The Netherlands k 


Citizenship: 
Belgian 


Post Office Address: 

C.J. van Houtenlaan 36, 1381 CP Weesp, The Netherlands 
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Finnegan. Henderson, Farabow, Garrett & Dunner, L.L.P. • Washington. D.C. 



I/We hereby declare that all statements made herein of my own knowledge are true 
and that all statements made on information and belief are believed to be true; and further 
that these statements were made with the knowledge that willful false statements and the like 
so made are punishable by fine or imprisonment, or both, under section 1001 of Title 18 of 
the United States Code, and that such willful false statements may jeopardize the validity of 
the application or any patent issuing thereon. 

IN TESTIMONY WHEREOF, I/We have hereunto set our hands. 



Full Name of First Inventor: 
OaJkoti^Zexiexna,. 


Inventor'sSignature: 


Date: 


Residence: Weesp, The NetherlanC 




Citizenship: 
Dutch 


Post Office Address: 

C.J. van Houtenlaan 36, 1381 CP Weesp. The Netherlands 
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Finnegan, Henderson, Farabow, Garrett & Dunner, L.L.P. • Washington, D.C. 



I/We hereby declare that all statements made herein of my own knowledge are true 
and that all statements made on information and belief are believed to be true; and further 
that these statements were made with the knowledge that willful false statements and the like 
so made are punishable by fine or imprisonment, or both, under section 1001 of Title 18 of 
the United States Code, and that such willful false statements may jeopardize the validity of 
the application pr any patent issuing thereon. 

IN TESTIMONY WHEREOF, I/We have hereunto set our hands. 



Full Name of First Inventor: 
Claudia Beraer 


Inventor's Signature: 


Date: 


Residence: WeespTThe-Netherlanc 

Hannover, Germany 




Citizenship: 
German 


Post Office Address: 

C.J. van Houtenlaan 36, 1381 CP Weesp, The Netherlands 


Full Name of Second Inventor: 
Christiane Loken 


Inventor's Signature: 

^^^^^^^ ^^^^^^^ ^^^^^^^^^ 


Date: 


Residence: Weesp^Th© JSlether4arwis-- 

Hannover, Germany 


Citizenship: 
German 


Post Office Address: 

CJ. van Houtenlaan 36, 1381 CP Weesp, The Netherlands 



wo 01/25269 



PCT/EPOO/09584 



1/28 

SEQUENCE LISTING 

<:110> SOLVAY PHARMACEUTICALS B.V. 

<120> Novel human G-protein coupled Receptor 

<130> SPW99.06/HA 00,19 

<140> 
<141> 

<160> 34 

<170> Patentin Ver. 2.1 

<210> 1 

<211> 1658 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (55) . . (1299) 

<223> IGS4A long version 

<400> 1 

ggctcagctt gaaacagagc ctcgtaccag gggaggctca ggccttggat ttta atg 57 

Met 
1 

tea ggg atg gaa aaa ctt cag aat get tec tgg ate tac eag cag aaa 105 
Ser Gly Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Lys 

5 10 15 

eta gaa gat cea ttc eag aaa eae ctg aac age ace gag gag tat etg 153 
Leu Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu 
20 25 30 

gee ttc etc tgc gga cet egg cgc age cac ttc ttc etc ccc gtg tct 201 
Ala Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val Ser 
35 40 45 

gtg gtg tat gtg cca att ttt gtg gtg ggg gtc att ggc aat gtc ctg 24 9 
Val Val Tyr Val Pro He Phe Val Val Gly Val He Gly Asn Val Leu 
50 55 60 65 

9tg tgc ctg gtg att ctg cag cac cag get atg aag acg ccc ace aac 297 
Val Cys Leu Val He Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn 

70 75 80 

tac tac etc ttc age ctg gcg gtc tct gac etc etg gtc ctg etc ctt 345 
Tyr Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu 
85 90 95 

gga atg ccc ctg gag gtc tat gag atg tgg cgc aac tac cet ttc ttg 3 93 
Gly Met Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe Leu 
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100 105 110 

ttc ggg ccc gtg ggc tgc tac ttc aag acg gcc etc ttt gag acc gtg 441 
Phe Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val 
115 120 125 



tgc ttc gcc tec ate etc age ate ace acc gtc age gtg gag cge tac 489 
Cys Phe Ala Ser lie Leu Ser lie Thr Thr Val Ser Val Glu Arg Tyr 
130 135 140 145 



gtg gcc ate eta eac ccg ttc egc gcc aaa etg eag age ace egg cge 537 
Val Ala lie Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg 

150 155 160 



egg gcc etc agg ate etc ggc ate gtc tgg ggc ttc tec gtg etc ttc 585 

Arg Ala Leu Arg lie Leu Gly lie Val Trp Gly Phe Ser Val Leu Phe 
165 170 175 

tec ctg ccc aac acc age ate cat ggc ate aag ttc cac tac ttc ccc 633 

Ser Leu Pro Asn Thr Ser lie His Gly lie Lys Phe His Tyr Phe Pro 
180 185 190 



aat ggg tec ctg gtc cca ggt teg gcc acc tgt acg gtc ate aag ccc 681 
Asn Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val lie Lys Pro 
195 200 205 



atg tgg ate tac aat ttc ate ate 
Met Trp lie Tyr Asn Phe lie lie 
210 215 

etc etc ccc atg act gtc ate agt 
Leu Leu Pro Met Thr Val He Ser 

230 



cag gtc acc tec ttc eta ttc tac 729 
Gin Val Thr Ser Phe Leu Phe Tyr 
220 225 

gtc etc tac tac etc atg gca etc 777 
Val Leu Tyr Tyr Leu Met Ala Leu 
235 240 



aga eta aag aaa gac aaa tet ctt gag gca gat gaa ggg aat gca aat 825 
Arg Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn 
245 250 255 



att caa aga ccc tgc aga aaa tea gtc aac aag atg etg ttt gtc ttg 873 

He Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val Leu 

260 265 270 

gtc tta gtg ttt get ate tgt tgg gee ccg ttc cac att gac ega etc 921 

Val Leu Val Phe Ala He Cys Trp Ala Pro Phe His He Asp Arg Leu 

275 280 285 



ttc ttc age ttt gtg gag gag tgg agt gaa tec etg get get gtg ttc 969 
Phe Phe Ser Phe Val Glu Glu Trp Ser Glu Ser Leu Ala Ala Val Phe 
290 295 300 305 



aac etc gtc cat gtg gtg tea ggt gtc ttc ttc tac ctg age tea get 1017 
Asn Leu Val His Val Val Ser Gly Val Phe Phe Tyr Leu Ser Ser Ala 

310 315 320 



gtc aac ccc att ate tat aac eta ctg tet cge cge ttc cag gca gca 
Val Asn pro He He Tyr Asn Leu Leu Ser Arg Arg Phe Gin Ala Ala 



1065 
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325 330 335 

ttc cag aat gtg ate tct tct ttc cac aaa cag tgg cac tec cag cat 1113 
Phe Gin Asn Val lie Ser Ser Phe His Lys Gin Trp His Ser Gin His 
340 345 350 

gac cca cag ttg cca cct gee cag egg aac ate ttc etg aca gaa tgc 1161 
Asp Pro Gin Leu Pro Pro Ala Gin Arg Asn lie Phe Leu Thr Glu Cys 
355 360 365 

cac ttt gtg gag etg ace gaa gat ata ggt ccc caa ttc cca tgt cag 1209 
His Phe Val Glu Leu Thr Glu Asp lie Gly Pro Gin Phe Pro Cys Gin 
370 375 380 385 

tea tec atg cac aac tct cac etc cca aca gee etc tct agt gaa cag 1257 
Ser Ser Met His Asn Ser His Leu Pro Thr Ala Leu Ser Ser Glu Gin 

390 395 400 

atg tea aga aca aac tat caa age ttc cac ttt aac aaa acc 12 99 

Met Ser Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 
405 410 415 

tgaattcttt cagagctgac tctcctctat gcctcaaaac ttcagagagg aacatcccat 1359 

aatgtatgcc ttetcatatg atattagaga ggtagaatgg ctcttacaac tcatgtaccc 1419 

attgctagtt tttttttttt aataaacgtg aaaactgaga gttagatctg gtttcaaaac 1479 

ccaagactgc ctgattttta gttatctttc cactatccta actgcctcat gccccttcac 153 9 

tagttcatgc caagaaegtg actggaaagg catggcacct ataccttgat taatttecat 1599 

taatggaaat ggttcgtcct gagtcatcta cgttecgagt caggctgtca ctcctacta 1658 



<210> 2 

<211> 415 
<212> PRT 
<213> Homo sapiens 

<400> 2 

Met Ser Gly Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin 
15 10 15 

Lys Leu Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr 
20 25 30 

Leu Ala Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val 
35 40 45 

Ser Val Val Tyr Val Pro He Phe Val Val Gly Val He Gly Asn Val 
50 55 60 



Leu Val Cys Leu Val He Leu Gin His Gin Ala Met Lys Thr Pro Thr 
65 70 75 80 
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Asn Tyr Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu 

85 90 95 

Leu Gly Met Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe 
100 105 110 

Leu Phe Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr 
115 120 125 

Val Cys Phe Ala Ser He Leu Ser He Thr Thr Val Ser Val Glu Arg 
130 135 140 

Tyr Val Ala He Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg 
145 150 155 160 

Arg Arg Ala Leu Arg He Leu Gly He Val Trp Gly Phe Ser Val Leu 

165 170 175 

Phe Ser Leu Pro Asn Thr Ser He His Gly He Lys Phe His Tyr Phe 
180 185 190 

Pro Asn Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val He Lys 
195 200 205 

Pro Met Trp He Tyr Asn Phe He He Gin Val Thr Ser Phe Leu Phe 
210 215 220 

Tyr Leu Leu Pro Met Thr Val He Ser Val Leu Tyr Tyr Leu Met Ala 
225 230 235 240 

Leu Arg Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala 

245 250 255 

Asn He Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val 
260 265 270 

Leu Val Leu Val Phe Ala He Cys Trp Ala Pro Phe His He Asp Arg 
275 280 285 

Leu Phe Phe Ser Phe Val Glu Glu Trp Ser Glu Ser Leu Ala Ala Val 
290 295 300 

Phe Asn Leu Val His Val Val Ser Gly Val Phe Phe Tyr Leu Ser Ser 
305 310 315 320 

Ala Val Asn Pro He He Tyr Asn Leu Leu Ser Arg Arg Phe Gin Ala 

325 330 335 

Ala Phe Gin Asn Val He Ser Ser Phe His Lys Gin Trp His Ser Gin 
340 345 350 

His Asp Pro Gin Leu Pro Pro Ala Gin Arg Asn He Phe Leu Thr Glu 
355 360 365 



Cys His Phe Val Glu Leu Thr Glu Asp He Gly Pro Gin Phe Pro Cys 
370 375 380 
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Gin Ser Ser Met His Asn Ser His Leu Pro Thr Ala Leu Ser Ser Glu 
385 390 395 400 

Gin Met Ser Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 

405 410 415 



<210> 3 
<211> 1658 
<212> DNA 

<213> Homo sapiens ' 

<220> 

<221> CDS 

<222> (64) . . (1299) 

<223> IGS4A short version 

<400> 3 

ggctcagctt gaaacagagc ctcgtaccag gggaggctca ggccttggat tttaatgtca 60 

ggg atg gaa aaa ctt cag aat get tec tgg ate tae eag cag aaa eta 108 
Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Lys Leu 
15 10 15 

gaa gat eea ttc cag aaa eac ctg aac age acc gag gag tat ctg gee 156 
Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu Ala 

20 25 30 

tte etc tgc gga cct egg cgc age eac ttc tte etc cce gtg tct gtg 2 04 
Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val Ser Val 
35 40 45 

gtg tat gtg eea att ttt gtg gtg ggg gtc att ggc aat gtc ctg gtg 252 
Val Tyr Val Pro He Phe Val Val Gly Val He Gly Asn Val Leu Val 
50 55 60 

tgc ctg gtg att ctg cag eac cag get atg aag aeg cce acc aac tac 300 
Cys Leu Val He Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn Tyr 
65 70 75 

tac etc ttc age ctg gcg gtc tct gac etc ctg gtc ctg etc ctt gga 348 
Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu Gly 
80 85 90 95 

atg cce ctg gag gtc tat gag atg tgg cgc aac tac cct ttc ttg ttc 396 
Met Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe Leu Phe 

100 105 XIO 

ggg cce gtg ggc tgc tac ttc aag aeg gee etc ttt gag acc gtg tgc 444 
Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val Cys 
115 120 125 



ttc gee tec ate etc age ate ace acc gtc age gtg gag cgc tac gtg 
Phe Ala Ser He Leu Ser He Thr Thr Val Ser Val Glu Arg Tyr Val 



492 
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130 135 140 

gcc ate eta cac ecg ttc cgc gee aaa ctg cag age acc egg cgc egg 
Ala lie Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg Arg 
145 150 155 



540 



gee etc agg ate cte ggc ate gtc tgg ggc ttc tec gtg cte ttc tee 588 
Ala Leu Arg lie Leu Gly lie Val Trp Gly Phe Ser Val Leu Phe Ser 
160 165 170 175 

Ctg ccc aac acc age ate eat ggc ate aag ttc eae tac ttc ccc aat 636 
Leu Pro Asn Thr Ser lie His Gly lie Lys Phe His Tyr Phe Pro Asn 

180 185 190 

ggg tec ctg gtc cea ggt teg gee ace tgt aeg gtc ate aag ccc atg 684 
Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val lie Lys Pro Met 
195 200 205 

tgg ate tac aat ttc ate ate cag gtc acc tec ttc eta ttc tac etc 732 
Trp He Tyr Asn Phe He He Gin Val Thr Ser Phe Leu Phe Tyr Leu 
210 215 220 

etc ccc atg act gtc ate agt gtc etc tac tac etc atg gca etc aga 780 
Leu Pro Met Thr Val He Ser Val Leu Tyr Tyr Leu Met Ala Leu Arg 
225 230 235 

eta aag aaa gac aaa tct ctt gag gca gat gaa ggg aat gca aat att 828 
Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn He 
240 245 250 255 

caa aga ccc tgc aga aaa tea gtc aac aag atg ctg ttt gtc ttg gtc 876 
Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val Leu Val 

260 265 270 

tta gtg ttt get ate tgt tgg gee ecg ttc cac att gac ega etc ttc 924 
Leu Val Phe Ala He Cys Trp Ala Pro Phe His He Asp Arg Leu Phe 
275 280 285 

ttc age ttt gtg gag gag tgg agt gaa tee ctg get get gtg ttc aac 972 
Phe Ser Phe Val Glu Glu Trp Ser Glu Ser Leu Ala Ala^ Val Phe Asn 
290 295 300 

cte gtc cat gtg gtg tea ggt gtc ttc ttc tac ctg age tea get gtc 1020 
Leu Val His Val Val Ser Gly Val Phe Phe Tyr Leu Ser Ser Ala Val 
305 310 315 

aac ccc att ate tat aac eta ctg tct cgc cgc ttc cag gca gca ttc 1068 
Asn Pro He He Tyr Asn Leu Leu Ser Arg Arg Phe Gin Ala Ala Phe 
320 325 330 335 

cag aat gtg ate tct tct ttc cac aaa cag tgg cac tec cag cat gac 1116 
Gin Asn Val He Ser Ser Phe His Lys Gin Trp His Ser Gin His Asp 

340 345 350 



cea cag ttg cea cet gee cag egg aac ate ttc ctg aca gaa tgc cac 
Pro Gin Leu Pro Pro Ala Gin Arg Asn He Phe Leu Thr Glu Cys His 
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355 360 365 

ttt gtg gag ctg acc gaa gat ata ggt ccc caa ttc cca tgt cag tea 1212 
Phe Val Glu Leu Thr Glu Asp lie Gly Pro Gin Phe Pro Cys Gin Ser 
370 375 380 

tec atg cac aac tct cac etc cca aca gcc etc tet agt gaa cag atg 1260 
Ser Met His Asn Ser His Leu Pro Thr Ala Leu Ser Ser Glu Gin Met 
385 390 395 

tea aga aca aac tat caa age ttc cac ttt aac aaa aee tgaattcttt 13 09 
Ser Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 
400 405 410 

cagagctgac tctcctctat gcctcaaaac ttcagagagg aacatcccat aatgtatgcc 1369 
ttctcatatg atattagaga ggtagaatgg etcttacaac teatgtacce attgetagtt 1429 
tttttttttt aataaacgtg aaaactgaga gttagatctg gtttcaaaac ccaagactgc 1489 
etgattttta gttatctttc cactatccta actgccteat gccccttcac tagttcatgc 1549 
caagaacgtg actggaaagg catggcacct ataccttgat taatttccat taatggaaat 1609 
ggttcgtcct gagtcatcta cgttccgagt caggctgtca ctcctacta 1658 

<210> 4 

<211> 412 

<212> PRT 

<213> Homo sapiens 

<400> 4 

Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Lys Leu Glu 
15 10 15 

Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu Ala Phe 
20 25 30 

Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val Ser Val Val 
35 40 45 

Tyr Val Pro lie Phe Val Val Gly Val lie Gly Asn Val Leu Val Cys 
50 55 60 

Leu Val lie Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn Tyr Tyr 
65 70 75 80 

Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu Gly Met 

85 90 95 

Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe Leu Phe Gly 
100 105 110 

Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val Cys Phe 
115 120 125 
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Ala Ser lie Leu Ser 
130 

lie Leu His Pro Phe 
145 

Leu Arg lie Leu Gly 

165 

Pro Asn Thr Ser lie 
180 

Ser Leu Val Pro Gly 
195 

lie Tyr Asn Phe lie 
210 

Pro Met Thr Val lie 
225 

Lys Lys Asp Lys Ser 

245 

Arg Pro Cys Arg Lys 
260 

Val Phe Ala lie Cys 
275 

Ser Phe Val Glu Glu 
290 

Val His Val Val Ser 
305 

Pro lie lie Tyr Asn 

325 

Asn Val lie Ser Ser 
340 

Gin Leu Pro Pro Ala 
355 

Val Glu Leu Thr Glu 
370 

Met His Asn Ser His 
385 

Arg Thr Asn Tyr Gin 

405 



8/28 

He Thr Thr Val Ser Val 
135 

Arg Ala Lys Leu Gin Ser 
150 155 

He Val Trp Gly Phe Ser 

170 

His Gly He Lys Phe His 
185 

Ser Ala Thr Cys Thr Val 
200 

He Gin Val Thr Ser Phe 
215 

Ser Val Leu Tyr Tyr Leu 
230 235 

Leu Glu Ala Asp Glu Gly 

250 

Ser Val Asn Lys Met Leu 
265 

Trp Ala Pro Phe His He 
280 

Trp Ser Glu Ser Leu Ala 
295 

Gly Val Phe Phe Tyr Leu 
310 315 

Leu Leu Ser Arg Arg Phe 

330 

Phe His Lys Gin Trp His 
345 

Gin Arg Asn He Phe Leu 
360 

Asp He Gly Pro Gin Phe 
375 

Leu Pro Thr Ala Leu Ser 
390 395 

Ser Phe His Phe Asn Lys 

410 



Glu Arg Tyr Val Ala 
140 

Thr Arg Axg Arg Ala 

160 

Val Leu Phe Ser Leu 
175 

Tyr Phe Pro Asn Gly 
190 

He Lys Pro Met Trp 
205 

Leu Phe Tyr Leu Leu 
220 

Met Ala Leu Arg Leu 

240 

Asn Ala Asn He Gin 
255 

Phe Val Leu Val Leu 
270 

Asp Arg Leu Phe Phe 
285 

Ala Val Phe Asn Leu 
300 

Ser Ser Ala Val Asn 

320 

Gin Ala Ala Phe Gin 
335 

Ser Gin His Asp Pro 
350 

Thr Glu Cys His Phe 
365 

Pro Cys Gin Ser Ser 
380 

Ser Glu Gin Met Ser 

400 

Thr 
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<210> 5 

<211> 1658 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (55) . . (1299) 

<223> IGS4B long version 

<400> 5 

ggctcagctt gaaacagagc ctcgtaccag gggaggctca ggccttggat ttta atg 57 

Met 
1 

tea ggg atg gaa aaa ctt cag aat get tee tgg ate tac cag cag aaa 105 
Ser Gly Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Lys 

5 10 15 

eta gaa gat eca ttc cag aaa cac ctg aac age ace gag gag tat ctg 153 
Leu Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu 
20 25 30 

gee ttc etc tgc gga cct egg cgc age cac ttc tte etc ecc gtg tct 201 
Ala Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val Ser 
35 40 45 

9tg gtg tat gtg eca att ttt gtg gtg ggg gtc att ggc aat gtc ctg 249 
Val Val Tyr Val Pro lie Phe Val Val Gly Val He Gly Asn Val Leu 
50 55 60 65 

gtg tgc ctg gtg att ctg cag cac cag get atg aag - aeg ecc ace aac 297 
Val Cys Leu Val He Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn 

70 75 80 

tac tac etc tte age ctg gcg gtc tct gac etc ctg gtc ctg etc ctt 345 
Tyr Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu 
85 90 95 

gga atg ecc ctg gag gtc tat gag atg tgg cgc aac tac cct ttc ttg 393 
Gly Met Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe Leu 
100 105 110 

ttc ggg ecc gtg ggc tgc tac ttc aag aeg gee etc ttt gag ace gtg 441 
Phe Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val 
115 120 125 

tgc ttc gee tec ate etc age ate ace ace gtc age gtg gag cgc tac 489 
Cys Phe Ala Ser He Leu Ser He Thr Thr Val Ser Val Glu Arg Tyr 
130 135 140 145 

gtg gee ate eta cac ccg ttc cgc gcc aaa ctg cag age acc egg cgc 537 
Val Ala He Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg 

150 155 160 
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egg gcc etc agg ate etc ggc ate gtc tgg ggc ttc tec gtg etc ttc 585 

Arg Ala Leu Arg lie Leu Gly lie Val Trp Gly Phe Ser Val Leu Phe 
165 170 175 

tec ctg ccc aac acc age ate eat ggc ate aag ttc cac tac ttc cec 633 
Ser Leu Pro Asn Thr Ser lie His Gly He Lys Phe His Tyr Phe Pro 
180 185 190 

aat ggg tec ctg gtc eca ggt teg gee ace tgt acg gtc ate aag ccc 681 
Asn Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val He Lys Pro 
195 200 205 

atg tgg ate tac aat ttc ate ate cag gtc acc tec ttc eta ttc tac 729 
Met Trp He Tyr Asn Phe He He Gin Val Thr Ser Phe Leu Phe Tyr 
210 215 220 225 

etc etc ccc atg act gtc ate agt gtc etc tac tac etc atg gca etc 777 
Leu Leu Pro Met Thr Val He Ser Val Leu Tyr Tyr Leu Met Ala Leu 

230 235 240 

aga eta aag aaa gac aaa tct ett gag gca gat gaa ggg aat gca aat 825 
Arg Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn 
245 250 255 

att caa aga ccc tgc aga aaa tea gtc aac aag atg ctg ttt gtc ttg 873 
He Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val Leu 
260 265 270 

gtc tta gtg ttt get ate tgt tgg gcc ceg ttc cac att gac cga etc 921 
Val Leu Val Phe Ala He Cys Trp Ala Pro Phe His He Asp Arg Leu 
275 280 285 

ttc ttc age ttt gtg gag gag tgg act gaa tec ctg get get gtg ttc 969 
Phe Phe Ser Phe Val Glu Glu Trp Thr Glu Ser Leu Ala Ala Val Phe 
290 295 300 305 

aac etc gtc eat gtg gtg tea ggt gtc tta ttc tac ctg age tea get 1017 
Asn Leu Val His Val Val Ser Gly Val Leu Phe Tyr Leu Ser Ser Ala 

310 315 320 

gtc aac ccc att ate tat aac eta ctg tct cgc cgc ttc cag gca gca 1065 
Val Asn Pro He He Tyr Asn Leu Leu Ser Arg Arg Phe Gin Ala Ala 
325 330 335 

ttc cag aat gtg ate tct tct ttc cac aaa cag tgg cac tec cag cat 1113 
Phe Gin Asn Val He Ser Ser Phe His Lys Gin Trp His Ser Gin His 
340 345 350 

gac eca cag ttg eca cct gcc cag egg aac ate ttc ctg aca gaa tgc 1161 
Asp Pro Gin Leu Pro Pro Ala Gin Arg Asn He Phe Leu Thr Glu Cys 
355 360 365 

cac ttt gtg gag ctg acc gaa gat ata ggt ccc caa ttc eta tgt cag 1209 
His Phe Val Glu Leu Thr Glu Asp He Gly Pro Gin Phe Leu Cys Gin 
370 375 380 385 
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tea tec gtg cac aac tct cac etc cca aca gee etc tet agt gaa eag 1257 
Ser Ser Val His Asn Ser His Leu Pro Thr Ala Leu Ser Ser Glu Gin 

390 395 400 

atg tea aga aea aae tat caa age ttc cac ttt aac aaa acc 12 99 

Met Ser Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 
405 410 415 

tgaattcttt cagagctgac tctcctctat gcctcaaaac ttcagagagg aacatcccat 1359 

aatgtatgce ttcteatatg aaattagaga ggtagaatgg ctcttacaac tcatgtaccc 1419 

attgctagtt tttttttttt aataaacgtg aaaactgaga gttagatctg gtttcaaaac 1479 

ccaagactgc ctgattttta gttatctttc cactatccta actgcctcat gccccttcac 153 9 

tagttcatgc caagaacgtg actggaaagg eatggcacct ataecttgat taatttecat 15 99 

taatggaaat ggttcgtcct gagtcatcta cgttccgagt caggctgtca ctcctacta 1658 

<210> 6 

<211> 415 

<212> PRT 

<213> Homo sapiens 

<:400> 6 

Met Ser Gly Met Glu Lys Leu Gin Asn Ala Ser Trp He Tyr Gin Gin 
15 10 15 

Lys Leu Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr 
20 25 30 

Leu Ala Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val 
35 40 45 

Ser Val Val Tyr Val Pro He Phe Val Val Gly Val He Gly Asn Val 
50 55 60 

Leu Val Cys Leu Val He Leu Gin His Gin Ala Met Lys Thr Pro Thr 
65 70 75 80 

Asn Tyr Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu 

85 90 95 

Leu Gly Met Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe 
100 105 110 

Leu Phe Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr 
115 120 125 

Val Cys Phe Ala Ser He Leu Ser He Thr Thr Val Ser Val Glu Arg 
130 135 140 

Tyr Val Ala He Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg 
145 150 155 160 
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Arg Arg Ala Leu Arg He Leu Gly He Val Trp Gly Phe Ser Val Leu 

165 170 175 

Phe Ser Leu Pro Asn Thr Ser He His Gly He Lys Phe His Tyr Phe 
180 185 190 

Pro Asn Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val He Lys 
195 200 205 

Pro Met Trp He Tyr Asn Phe He He Gin Val Thr Ser Phe Leu Phe 
210 215 220 

Tyr Leu Leu Pro Met Thr Val He Ser Val Leu Tyr Tyr Leu Met Ala 
225 230 235 240 

Leu Arg Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala 

245 250 255 

Asn He Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val 
260 265 270 

Leu Val Leu Val Phe Ala He Cys Trp Ala Pro Phe His He Asp Arg 
275 280 285 

Leu Phe Phe Ser Phe Val Glu Glu Trp Thr Glu Ser Leu Ala Ala Val 
290 295 300 

Phe Asn Leu Val His Val Val Ser Gly Val Leu Phe Tyr Leu Ser Ser 
305 310 315 320 

Ala Val Asn Pro He He Tyr Asn Leu Leu Ser Arg Arg Phe Gin Ala 

325 330 335 

Ala Phe Gin Asn Val He Ser Ser Phe His Lys Gin Trp His Ser Gin 
340 345 350 

His Asp Pro Gin Leu Pro Pro Ala Gin Arg Asn He Phe Leu Thr Glu 
355 360 365 

Cys His Phe Val Glu Leu Thr Glu Asp He Gly Pro Gin Phe Leu Cys 
370 375 380 

Gin Ser Ser Val His Asn Ser His Leu Pro Thr Ala Leu Ser Ser Glu 
385 390 395 400 

Gin Met Ser Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 

405 410 415 



<210> 7 

<211> 1658 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> CDS 

<222> (64) . . (1299) 

<223> IGS4B short version 

<400> 7 

ggctcagctt gaaacagagc ctcgtaccag gggaggctca ggccttggat tttaatgtca 60 

ggg atg gaa aaa ctt cag aat get tec tgg ate tac cag cag aaa eta 108 
Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Isys Leu 
15 10 15 

gaa gat cca ttc cag aaa eac ctg aac age acc gag gag tat ctg gcc 156 
Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu Ala 

20 25 30 

ttc etc tgc gga cct egg ege age eac tte tte etc cec gtg tet gtg 204 
Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val Ser Val 
35 40 45 

gtg tat gtg cea att ttt gtg gtg ggg gtc att ggc aat gtc ctg gtg 252 
Val Tyr Val Pro lie Phe Val Val Gly Val lie Gly Asn Val Leu Val 
50 55 60 

tgc ctg gtg att ctg cag cac cag get atg aag aeg cec ace aac tac 3 00 
Cys Leu Val lie Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn Tyr 
65 70 75 

tac etc tte age ctg gcg gtc tet gae etc ctg gtc ctg etc ctt gga 348 
Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu Gly 
80 85 90 95 

atg cec ctg gag gtc tat gag atg tgg cgc aac tac cct ttc ttg tte 3 96 
Met Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe Leu Phe 

100 105 110 

ggg cec gtg ggc tgc tac ttc aag acg gcc etc ttt gag acc gtg tgc 444 
Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val Cys 
115 120 125 

ttc gee tec ate etc age ate acc acc gtc age gtg gag cgc tac gtg 4 92 
Phe Ala Ser lie Leu Ser lie Thr Thr Val Ser Val Glu Arg Tyr Val 
130 135 140 

gee ate eta eac ccg tte cgc gee aaa ctg cag age acc egg cgc egg 54 0 
Ala lie Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg Arg 
145 150 155 

gee etc agg ate etc ggc ate gtc tgg ggc ttc tec gtg etc ttc tec 588 
Ala Leu Arg lie Leu Gly lie Val Trp Gly Phe Ser Val Leu Phe Ser 
160 165 170 175 



ctg cec aac ace age ate cat ggc ate aag ttc eac tac tte cec aat 
Leu Pro Asn Thr Ser He His Gly He Lys Phe His Tyr Phe Pro Asn 

180 185 190 



636 
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ggg tec ctg gtc cca ggt teg gcc acc tgt acg gtc ate aag ccc atg 684 
Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val lie Lys Pro Met 
195 200 205 

tgg ate tac aat ttc ate ate cag gtc aec tec ttc eta ttc tac etc 732 
Trp lie Tyr Asn Phe lie lie Gin Val Thr Ser Phe Leu Phe Tyr Leu 
210 215 220 

etc ccc atg act gtc ate agt gtc etc tac tac etc atg gca etc aga 780 
Leu Pro Met Thr Val lie Ser Val Leu Tyr Tyr Leu Met Ala Leu Arg 
225 230 235 

eta aag aaa gac aaa tet ett gag gca gat gaa ggg aat gca aat att 82 8 
Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn lie 
240 245 250 255 

caa aga ccc tgc aga aaa tea gtc aac aag atg ctg ttt gtc ttg gtc 876 
Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val Leu Val 

260 265 270 

tta gtg ttt get ate tgt tgg gcc ccg ttc cac att gac cga etc ttc 924 
Leu Val Phe Ala lie Cys Trp Ala Pro Phe His lie Asp Arg Leu Phe 
275 280 285 

ttc age ttt gtg gag gag tgg act gaa tec ctg get get gtg ttc aac 972 
Phe Ser Phe Val Glu Glu Trp Thr Glu Ser Leu Ala Ala Val Phe Asn 
290 295 300 

etc gtc eat gtg gtg tea ggt gtc tta ttc tac ctg age tea get gtc 1020 
Leu Val His Val Val Ser Gly Val Leu Phe Tyr Leu Ser Ser Ala Val 
305 310 315 

aac ccc att ate tat aac eta ctg tet egc cgc ttc cag gca gca ttc 1068 
Asn Pro lie He Tyr Asn Leu Leu Ser Arg Arg Phe Gin Ala Ala Phe 
320 325 330 335 

cag aat gtg ate tet tet ttc cac aaa cag tgg cac tec cag eat gac 1116 
Gin Asn Val He Ser Ser Phe His Lys Gin Trp His Ser Gin His Asp 

340 345 350 

cea cag ttg cca ect gee cag egg aac ate ttc ctg aca gaa tgc cac 1164 
Pro Gin Leu Pro Pro Ala Gin T^g Asn He Phe Leu Thr Glu Cys His 
355 360 365 

ttt gtg gag ctg ace gaa gat ata ggt ccc caa ttc eta tgt cag tea 1212 
Phe Val Glu Leu Thr Glu Asp He Gly Pro Gin Phe Leu Cys Gin Ser 
370 375 380 

tec gtg cac aac tet cac etc cca aca gcc etc tet agt gaa cag atg 1260 
Ser Val His Asn Ser His Leu Pro Thr Ala Leu Ser Ser Glu Gin Met 
385 390 395 

tea aga aca aac tat caa age ttc cac ttt aac aaa ace tgaattcttt 1309 
Ser Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 
400 405 410 
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cagagctgac tctcctctat gcctcaaaac ttcagagagg aacatcccat aatgtatgcc 1369 
ttctcatatg aaattagaga ggtagaatgg ctcttacaac tcatgtaccc attgctagtt 142 9 
tttttttttt aataaacgtg aaaactgaga gttagatctg gtttcaaaac ccaagactgc 14 89 
ctgattttta gttatctttc cactatccta actgcctcat gccccttcac tagttcatgc 1549 
caagaacgtg actggaaagg catggcacct ataccttgat taatttccat taatggaaat 1609 
ggttcgtcct gagtcatcta cgttccgagt caggctgtca ctcctacta 1658 

<210> 8 

<211> 412 

<212> PRT 

<213> Homo sapiens 

<400> 8 

Met Glu Ijys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Lys Leu Glu 
15 10 15 

Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu Ala Phe 
20 25 30 

Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val Ser Val Val 
35 40 45 

Tyr Val Pro lie Phe Val Val Gly Val lie Gly Asn Val Leu Val Cys 
50 55 60 

Leu Val He Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn Tyr Tyr 
65 70 75 80 

Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu Gly Met 

85 90 95 

Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe Leu Phe Gly 
100 105 110 

Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val Cys Phe 
115 120 125 

Ala Ser He Leu Ser He Thr Thr Val Ser Val Glu Arg Tyr Val Ala 
130 135 140 

He Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg Arg Ala 
145 150 155 160 

Leu Arg He Leu Gly He Val Trp Gly Phe Ser Val Leu Phe Ser Leu 

165 170 175 

Pro Asn Thr Ser He His Gly He Lys Phe His Tyr Phe Pro Asn Gly 
180 185 190 

Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val He Lys Pro Met Trp 
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lie Tyr Asn Phe lie 
210 

Pro Met Thr Val lie 
225 

Lys Lys Asp Lys Ser 

245 

Arg Pro Cys Arg Lys 
260 

Val Phe Ala lie Cys 
275 

Ser Phe Val Glu Glu 
290 

Val His Val Val Ser 
305 

Pro lie lie Tyr Asn 

325 

Asn Val lie Ser Ser 
340 

Gin Leu Pro Pro Ala 
355 

Val Glu Leu Thr Glu 
370 

Val His Asn Ser His 
385 

Arg Thr Asn Tyr Gin 

405 
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200 

lie Gin Val Thr Ser Phe 
215 

Ser Val Leu Tyr Tyr Leu 
230 235 

Leu Glu Ala Asp Glu Gly 

250 

Ser Val Asn Lys Met Leu 
265 

Trp Ala Pro Phe His lie 
280 

Trp Thr Glu Ser Leu Ala 
295 

Gly Val Leu Phe Tyr Leu 
310 315 

Leu Leu Ser Arg Arg Phe 

330 

Phe His Lys Gin Trp His 
345 

Gin Arg Asn lie Phe Leu 
360 

Asp He Gly Pro Gin Phe 
375 

Leu Pro Thr Ala Leu Ser 
390 395 

Ser Phe His Phe Asn Lys 

410 



205 

Leu Phe Tyr Leu Leu 
220 

Met Ala Leu Arg Leu 

240 

Asn Ala Asn He Gin 
255 

Phe Val Leu Val Leu 
270 

Asp Arg Leu Phe Phe 
285 

Ala Val Phe Asn Leu 
300 

Ser Ser Ala Val Asn 

320 

Gin Ala Ala Phe Gin 
335 

Ser Gin His Asp Pro 
350 

Thr Glu Cys His Phe 
365 

Leu Cys Gin Ser Ser 
380 

Ser Glu Gin Met Ser 

400 

Thr 



<210> 9 
<211> 1594 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (55) . . (942) 

<223> IGS4A truncated DNA long version 
<400> 9 

ggctcagctt gaaacagagc ctcgtaccag gggaggctca ggccttggat ttta atg 57 

Met 
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tea ggg atg gaa aaa ctt cag aat get tec tgg ate tae cag cag aaa 105 
Ser Gly Met. Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Lys 

5 10 15 

eta gaa gat eea ttc eag aaa cac ctg aae age ace gag gag tat ctg 153 
Leu Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu 
20 25 30 

gee ttc etc tgc gga eet egg ege age cac ttc ttc etc cce gtg tct 201 
Ala Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val Ser 
35 40 45 

gtg gtg tat gtg eea att ttt gtg gtg ggg gte att gge aat gte ctg 249 
Val Val Tyr Val Pro lie Phe Val Val Gly Val lie Gly Asn Val Leu 
50 55 60 65 

Qtg tgc ctg gtg att ctg cag cac cag get atg aag aeg ccc ace aae 2 97 
Val Cys Leu Val lie Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn 

70 75 80 

tae tac etc ttc age ctg gcg gte tct gac etc ctg gte ctg etc ctt 345 
Tyr Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu 
85 90 95 

gga atg ccc ctg gag gte tat gag atg tgg cgc aae tae eet ttc ttg 3 93 
Gly Met Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe Leu 
100 105 110 

ttc ggg ccc gtg gge tgc tac ttc aag aeg gee etc ttt gag ace gtg 441 
Phe Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val 
115 120 125 

tgc ttc gee tec ate etc age ate ace ace gte age gtg gag cgc tac 489 
Cys Phe Ala Ser He Leu Ser He Thr Thr Val Ser Val Glu Arg Tyr 
130 135 140 145 

gtg gee ate eta cac ccg ttc cgc gee aaa ctg cag age ace egg cgc 537 
Val Ala He Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg 

150 155 160 

egg gee etc agg ate etc gge ate gte tgg gge ttc tec gtg etc ttc 585 
Arg Ala Leu Arg He Leu Gly He Val Trp Gly Phe Ser Val Leu Phe 
165 170 175 

tec ctg ccc aac ace age ate cat gge ate aag ttc cac tac ttc ccc 633 
Ser Leu Pro Asn Thr Ser He His Gly He Lys Phe His Tyr Phe Pro 
180 185 190 

aat ggg tec ctg gte eea ggt teg gcc ace tgt aeg gte ate aag cce 681 
Asn Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val He Lys Pro 
195 200 205 



atg tgg ate tae aat ttc ate ate cag gte acc tec ttc eta ttc tae 
Met Trp He Tyr Asn Phe He He Gin Val Thr Ser Phe Leu Phe Tyr 



729 
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210 215 

etc etc ccc atg act gtc ate 
Leu Leu Pro Met Thr Val lie 

230 

aga eta aag aaa gac aaa tct 
Arg Leu Lys Lys Asp Lys Ser 
245 

att caa aga ccc tgc aga aaa 
lie Gin Arg Pro Cys Arg Lys 
260 

agg agt gga gtg aat ccc tgg 
Arg Ser Gly Val Asn Pro Trp 
275 280 

tgt cag gtg tct tct tct acc 
Cys Gin Val Ser Ser Ser Thr 
290 295 



acctactgtc 


tcgccgcttc 


caggcagcat 


tccagaatgt 


gatctcttct 


ttccacaaac 


1032 


agtggcactc 


ccagcatgac 


ccacagttgc 


cacctgccca 


gcggaacatc 


ttcctgacag 


1092 


aatgccactt 


tgtggagctg 


accgaagata 


taggtcccca 


attcccatgt 


cagtcatcca 


1152 


tgcacaactc 


tcacctccca 


acagccctct 


ctagtgaaca 


gatgtcaaga 


acaaactatc 


1212 


aaagcttcca 


ctttaacaaa 


acctgaattc 


tttcagagct 


gactctcctc 


tatgcctcaa 


1272 


aacttcagag 


aggaacatcc 


cataatgtat 


gccttctcat 


atgatattag 


^g^ggtagaa 


1332 


tggctcttac 


aactcatgta 


cccattgeta 


gttttttttt 


tttaataaac 


gtgaaaactg 


1392 


agagttagat 


ctggtttcaa 


aacccaagac 


tgcctgattt 


ttagttatct 


ttccactatc 


1452 


ctaactgcct 


catgcccctt 


cactagttca 


tgccaagaac 


gtgactggaa 


aggcatggca 


1512 


cctatacctt 


gattaatttc 


cattaatgga 


aatggttcgt 


cctgagtcat 


ctacgttccg 


1572 


agtcaggctg 


tcactcctac 


ta 








1594 



<210> 10 

<211> 296 

<212> PRT 

<213> Homo sapiens 

<400> 10 

Met Ser Gly Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin 
15 10 15 



220 225 

agt gtc etc tac tac etc atg gca etc 777 
Ser Val Leu Tyr Tyr Leu Met Ala Leu 
235 240 

ctt gag gca gat gaa ggg aat gca aat 825 
Leu Glu Ala Asp Glu Gly Asn Ala Asn 
250 255 

tea gtc aac aag atg ctg tct ttg tgg 873 
Ser Val Asn Lys Met Leu Ser Leu Trp 
265 270 

ctg ctg tgt tea acc teg tec atg tgg 921 
Leu Leu Cys Ser Thr Ser Ser Met Trp 

285 

tgagctcagc tgtcaacccc attatctata 972 



Lys Leu Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr 
20 25 30 
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Leu Ala Phe Leu 
35 

Ser Val Val Tyr 
50 

Leu Val Cys Leu 
65 

Asn Tyr Tyr Leu 



Leu Gly Met Pro 
100 

Leu Phe Gly Pro 
115 

Val Cys Phe Ala 
130 

Tyr Val Ala lie 
145 

Arg Arg Ala Leu 



Phe Ser Leu Pro 
180 

Pro Asn Gly Ser 
195 

Pro Met Trp lie 
210 

Tyr Leu Leu Pro 
225 

Leu Arg Leu Lys 



Asn lie Gin Arg 
260 

Trp Arg Ser Gly 
275 

Tarp Cys Gin Val 
290 



Cys Gly Pro Arg 
40 

Val Pro lie Phe 
55 

Val lie Leu Gin 
70 

Phe Ser Leu Ala 
85 

Leu Glu Val Tyr 



Val Gly Cys Tyr 
12 0 

Ser lie Leu Ser 
135 

Leu His Pro Phe 
150 

Arg lie Leu Gly 
165 

Asn Thr Ser lie 



Leu Val Pro Gly 
200 

Tyr Asn Phe lie 
215 

Met Thr Val lie 
230 

Lys Asp Lys Ser 
245 

Pro Cys Arg Lys 



Val Asn Pro Trp 
280 

Ser Ser Ser Thr 
295 



19/28 

Arg Ser His Phe 



Val Val Gly Val 
60 

His Gin Ala Met 
75 

Val Ser Asp Leu 
90 

Glu Met Trp Arg 
105 

Phe Lys Thr Ala 



He Thr Thr Val 
140 

Arg Ala Lys Leu 
155 

He Val Trp Gly 
170 

His Gly He Lys 
185 

Ser Ala Thr Cys 



He Gin Val Thr 
220 

Ser Val Leu Tyr 
235 

Leu Glu Ala Asp 
250 

Ser Val Asn Lys 
265 

Leu Leu Cys Ser 



Phe Leu Pro Val 
45 

He Gly Asn Val 



Lys Thr Pro Thr 
80 

Leu Val Leu Leu 
95 

Asn Tyr Pro Phe 
110 

Leu Phe Glu Thr 
125 

Ser Val Glu Arg 



Gin Ser Thr Arg 
160 

Phe Ser Val Leu 
175 

Phe His Tyr Phe 
190 

Thr Val He Lys 
205 

Ser Phe Leu Phe 



Tyr Leu Met Ala 
240 

Glu Gly Asn Ala 
255 

Met Leu Ser Leu 
270 

Thr Ser Ser Met 
285 



<210> 11 
<211> 1594 
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<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (64) . . (942) 

<223> IGS4A truncated DNA short version 
<400> 11 

ggctcagctt gaaacagagc ctcgtaccag gggaggctca ggccttggat tttaatgtca 60 

ggg atg gaa aaa ctt cag aat get tec tgg ate tac cag cag aaa eta 108 
Met Glu Iiys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Lys Leu 
15 10 15 

gaa gat cca ttc cag aaa cac ctg aac age ace gag gag tat ctg gee 156 
Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu Ala 

20 25 30 

ttc etc tgc gga cct egg cgc age cac ttc ttc etc ccc gtg tct gtg 204 
Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val Ser Val 
35 40 45 

gtg tat gtg cca att ttt gtg gtg ggg gtc att ggc aat gtc ctg gtg 252 
Val Tyr Val Pro He Phe Val Val Gly Val He Gly Asn Val Leu Val 
50 55 60 

tgc ctg gtg att ctg cag cac cag get atg aag acg ccc acc aac tac 3 00 
Cys Leu Val He Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn Tyr 
65 70 75 

tac etc ttc age ctg gcg gtc tct gac etc ctg gtc ctg etc ctt gga 348 
Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu Gly 
80 85 90 95 

atg ccc ctg gag gtc tat gag atg tgg cgc aac tac cct ttc ttg ttc 396 
Met Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe Leu Phe 

100 105 110 

ggg ccc gtg ggc tgc tac ttc aag acg gee etc ttt gag acc gtg tgc 444 
, Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val Cys 
115 120 125 

ttc gee tec ate etc age ate acc ace gtc age gtg gag cgc tac gtg 492 
Phe Ala Ser He Leu Ser He Thr Thr Val Ser Val Glu Arg Tyr Val 
130 135 140 

gee ate eta cac ccg ttc cgc gcc aaa ctg cag age ace egg cgc egg 540 
Ala He Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg Arg 
145 150 155 

gcc etc agg ate etc ggc ate gtc tgg ggc ttc tec gtg etc ttc tec 588 
Ala Leu Arg He Leu Gly He Val Trp Gly Phe Ser Val Leu Phe Ser 
160 165 170 175 



ctg ccc aac acc age ate cat ggc ate aag ttc cac tac ttc ccc aat 



636 



wo 01/25269 



PCT/EPOO/09584 



21/28 

Leu Pro Asn Thr Ser lie 

180 

ggg tec ctg gtc cca ggt 
Gly Ser Leu Val Pro Gly 
195 

tgg ate tac aat ttc ate 
Trp lie Tyr Asn Phe lie 
210 

etc cce atg act gtc ate 
Leu Pro Met Thr Val lie 
225 

eta aag aaa gae aaa tct 
Leu Lys Lys Asp Lys Ser 
240 245 

eaa aga cce tgc aga aaa 
Gin Arg Pro Cys Arg Lys 

260 

agt gga gtg aat cce tgg 
Ser Gly Val Asn Pro Trp 
275 

cag gtg tct tct tct acc 
Gin Val Ser Ser Ser Thr 
290 

acctactgtc tcgccgcttc caggcagcat tccagaatgt gatctcttct ttccacaaac 1032 
agtggcaete ccagcatgac ccacagttge cacctgccca geggaacatc ttcctgacag 1092 
aatgccactt tgtggagctg accgaagata taggtcccca attcccatgt cagtcatcca 1152 
tgcacaactc tcacctccca acagccctct ctagtgaaca gatgtcaaga acaaactatc 1212 
aaagcttcca ctttaacaaa acctgaattc tttcagagct gactctcetc tatgcctcaa 1272 
aacttcagag aggaacatcc cataatgtat gccttctcat atgatattag agaggtagaa 1332 
tggctettac aacteatgta cecattgeta gttttttttt tttaataaac gtgaaaaetg 1392 
agagttagat ctggtttcaa aacccaagac tgcctgattt ttagttatct ttccactatc 1452 
etaactgect catgcecctt cactagttea tgccaagaac gtgactggaa aggcatggea 1512 
cctatacctt gattaatttc cattaatgga aatggttcgt cctgagtcat ctacgttccg 1572 
agtcaggctg tcactcctac ta 1594 



<210> 12 
<211> 293 



His Gly lie Lys Phe His Tyr Phe Pro Asn 
185 190 

teg gee acc tgt acg gtc ate aag cce atg 684 
Ser Ala Thr Cys Thr Val lie Lys Pro Met 
200 205 

ate cag gtc ace tec ttc eta ttc tac etc 732 
lie Gin Val Thr Ser Phe Leu Phe Tyr Leu 
215 220 

agt gtc etc tac tac etc atg gea etc aga 780 
Ser Val Leu Tyr Tyr Leu Met Ala Leu Arg 
230 235 

ctt gag gea gat gaa ggg aat gea aat att 82 8 
Leu Glu Ala Asp Glu Gly Asn Ala Asn lie 

250 255 

tea gtc aac aag atg ctg tct ttg tgg agg 876 
Ser Val Asn Lys Met Leu Ser Leu Trp Arg 
265 270 

ctg ctg tgt tea acc teg tee atg tgg tgt 924 
Leu Leu Cys Ser Thr Ser Ser Met Trp Cys 
280 285 

tgagctcagc tgtcaacccc attatctata 972 
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<212> PRT 

<213> Homo sapiens 
<400> 12 

Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Lys Leu Glu 
15 10 15 

Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu Ala Phe 
20 25 30 

Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val Ser Val Val 
35 40 45 

Tyr Val Pro lie Phe Val Val Gly Val He Gly Asn Val Leu Val Cys 
50 55 60 

Leu Val He Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn Tyr Tyr 
65 70 75 80 

Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu Gly Met 

85 90 95 

Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe Leu Phe Gly 
100 105 110 

Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val Cys Phe 
115 120 125 

Ala Ser He Leu Ser He Thr Thr Val Ser Val Glu Arg Tyr Val Ala 
130 135 140 

He Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg Arg Ala 
145 150 155 160 

Leu Arg He Leu Gly He Val Trp Gly Phe Ser Val Leu Phe Ser Leu 

165 170 175 

Pro Asn Thr Ser He His Gly He Lys Phe His Tyr Phe Pro Asn Gly 
180 185 190 

Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val He Lys Pro Met Trp 
195 200 205 

He Tyr Asn Phe He He Gin Val Thr Ser Phe Leu Phe Tyr Leu Leu 
210 215 220 

Pro Met Thr Val He Ser Val Leu Tyr Tyr Leu Met Ala Leu Arg Leu 
225 230 235 240 

Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn He Gin 

245 250 255 

Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Ser Leu Trp Arg Ser 
260 265 270 



Gly Val Asn Pro Trp Leu Leu Cys Ser Thr Ser Ser Met Trp Cys Gin 



V -I .J"" » I, -t. " 4 V «) ai ) * 
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275 



280 



285 



Val Ser Ser Ser Thr 
290 



<210> 13 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
primers 

<220> 

<221> variation 

<222> (21) 

<223> A,C,G or T 

<220> 

<221> variation 

<222> (24) 

<223> A, C, G or T 



Degenerated 



<400> 13 

ctcatcttcg cggtgggcrc ngyngg 



26 



<210> 14 
<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Description of Artificial Sequence: Degenerated 
primers 

<220> 

<221> variation 

<222> (22) 

<22 3> C or Inosine 

<220> 

<221> variation 

<222> (25) 

<223> A, C, G or T 



<220> 

<221> variation 

<222> (28) 

<223> A, C, G or T 



<400> 14 

ggccaggcag cgctccgcgc tnarncyngc d 



31 
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<210> 15 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Degenerated 
primers 

<400> 15 

gaartartag ccrcgrcagc cw 

<210> 16 
<211> 27 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 16 

ccatcctaat acgactcact atagggc 

<210> 17 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 17 

actcactata gggctcgagc ggc 

<210> 18 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 18 

ggatcccaaa taagaaaggg tagttgc 

<210> 19 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Primer 
<400> 19 

aaagggtagt tgcgccacat ctcatagac 

<210> 20 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 20 

aggtctatga gatgtggcgc aactaccct 

<210> 21 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 

<400> 21 

atgtggcgca actacccttt cttatttggg 

<210> 22 

<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Degenerated 
primers 

<400> 22 

cggaagttgg cggacacgrv rttrta 

<210> 23 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 23 

gctcagcttg aaacagagcc tcgtacc 
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<210> 24 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 24 

ccatgtggat ctacaatttc atcatcc 

<210> 25 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 25 

aagacaaatc tcttgaggca gatgaaggg 

<210> 26 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 26 

gatgctgttt gtcttggtct tagtgtttgc 

<210> 27 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 27 

ggatgatgaa attgtagatc cacatgggc 

<210> 28 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
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<400:> 28 

tgtggagaag tctctcaaag tgtgg 



25 



<210> 29 
<211> 29 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 29 

tagtaggagt gacagcctga ctcggaacg 2 9 

<210> 30 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<210> 31 
<211> 22 
<212> DNA 

<213> Artificial Sec[uence 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 31 

tcgtaccagg ggaggctcag gc 22 

<210> 32 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Primer 



<400> 30 

aacgtagatg actcaggacg aaccatttcc 



30 



<400> 32 

cctcttcagc ctggcggtct ctg 



23 



<210> 33 
<211> 22 
<212> DNA 



<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Primer 
<400> 33 

ggaggcgaag cacacggtct ca 22 



<210> 34 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<220> 

<221> misc_binding 
<222> (1) 

<223> Labeled with 6-carboxyf luorescein 
<220> 

<221> misc_binding 

<222> (34) 

<223> Labeled with 

N,N,N' ,N' -tetramethyl-6-carboxyrhodamin 



<400> 34 

agatgtggcg caactaccct ttcttgttcg ggcc 



34 



WVfCr/PTO 22 JUL2002 



SEQUENCE LISTING 

<110> SOIiVAY PHARMACEUTICALS B.V. 

<12 0> Novel human G-protein coupled Receptor 

<130> SPW99.06/HA 00.19 

<140> 
<141> 

<160> 34 

<170> Patentin Ver. 2.1 

<210> 1 

<211> 1658 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (55) . . (1299) 

<223> IGS4A long version 

<400> 1 

ggctcagctt gaaacagagc ctcgtaccag gggaggctca ggccttggat ttta atg 5 7 

Met 
1 

tea ggg atg gaa aaa ctt cag aat get tec tgg ate tae cag eag aaa 105 
Ser Gly Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Lys 
5 10 15 

eta gaa gat eca ttc cag aaa cac ctg aac age ace gag gag tat ctg 153 
Leu Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu 
20 25 30 

gee ttc etc tgc gga cct egg cge age cac ttc ttc etc cee gtg tet 2 01 
Ala Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val Ser 
35 40 45 

gtg gtg tat gtg eca att ttt gtg gtg ggg gtc att ggc aat gtc ctg 249 
Val Val Tyr Val Pro lie Phe Val Val Gly Val lie Gly Asn Val Leu 
50 55 60 65 

gtg tgc ctg gtg att ctg cag cac eag get atg aag aeg cec ace aac 2 97 
Val Cys Leu Val lie Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn 

70 75 80 

tae tae etc ttc age ctg geg gtc tet gae etc ctg gtc ctg etc ctt 345 
Tyr Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu 
85 90 95 



gga atg cec ctg gag gtc tat gag atg tgg cge aac tae cct ttc ttg 
Gly Met Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe Leu 
100 105 110 



393 
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ttc ggg ccc gtg ggc tgc tac ttc aag acg gcc etc ttt gag acc gtg 441 

Phe Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val 
115 120 125 

tgc ttc gcc tec ate etc age ate ace acc gtc age gtg gag cgc tac 489 

Cys Phe Ala Ser He Leu Ser He Thr Thr Val Ser Val Glu Arg Tyr 

130 135 140 145 

gtg gee ate eta cac ccg ttc cgc gcc aaa ctg eag age acc egg cgc 537 

Val Ala He Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg 

150 155 160 

egg gcc etc agg ate etc ggc ate gtc tgg ggc ttc tec gtg etc ttc 585 

Arg Ala Leu Arg He Leu Gly He Val Trp Gly Phe Ser Val Leu Phe 
165 170 175 

tec ctg ccc aac acc age ate cat ggc ate aag ttc cac tac ttc ccc 63 3 

Ser Leu Pro Asn Thr Ser He His Gly He Lys Phe His Tyr Phe Pro 

180 185 190 

aat ggg tec ctg gtc eca ggt teg gee acc tgt acg gtc ate aag ccc 681 

Asn Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val He Lys Pro 
195 200 205 

atg tgg ate tac aat ttc ate ate cag gtc ace tec ttc eta ttc tac 729 

Met Trp He Tyr Asn Phe He He Gin Val Thr Ser Phe Leu Phe Tyr 

210 215 220 225 

etc etc ccc atg act gtc ate agt gtc etc tac tac etc atg gca etc 777 

Leu Leu Pro Met Thr Val He Ser Val Leu Tyr Tyr Leu Met Ala Leu 

230 235 240 

aga eta aag aaa gac aaa tct ctt gag gca gat gaa ggg aat gca aat 825 

Arg Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn 
245 250 255 

att caa aga ccc tgc aga aaa tea gtc aac aag atg ctg ttt gtc ttg 873 

He Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val Leu 

260 265 270 

gtc tta gtg ttt get ate tgt tgg gcc ccg ttc cac att gac cga etc 921 

Val Leu Val Phe Ala He Cys Trp Ala Pro Phe His He Asp Arg Leu 
275 280 285 

ttc ttc age ttt gtg gag gag tgg agt gaa tec ctg get get gtg ttc 969 

Phe Phe Ser Phe Val Glu Glu Trp Ser Glu Ser Leu Ala Ala Val Phe 

290 295 300 305 

aac etc gtc cat gtg gtg tea ggt gtc ttc ttc tac ctg age tea get 1017 

Asn Leu Val His Val Val Ser Gly Val Phe Phe Tyr Leu Ser Ser Ala 

310 315 320 

gtc aac ccc att ate tat aac eta ctg tct cgc cgc ttc eag gca gca 1065 

Val Asn Pro He He Tyr Asn Leu Leu Ser Arg Arg Phe Gin Ala Ala 
325 330 335 



ttc cag aat gtg ate tct tct ttc cac aaa cag tgg cac tec eag cat 
Phe Gin Asn Val lie Ser Ser Phe His Lys Gin Trp His Ser Gin His 
340 345 350 



1113 



gac cea eag ttg cca cct gee eag egg aac ate ttc ctg aca gaa tgc 1161 
Asp Pro Gin Leu Pro Pro Ala Gin Arg Asn lie Phe Leu Thr Glu Cys 
355 360 365 

cac ttt gtg gag ctg acc gaa gat ata ggt ecc caa ttc cca tgt cag 1209 
His Phe Val Glu Leu Thr Glu Asp lie Gly Pro Gin Phe Pro Cys Gin 
370 375 380 385 

tea tec atg cac aac tct cac etc cea aca gee etc tct agt gaa cag 1257 
Ser Ser Met His Asn Ser His Leu Pro Thr Ala Leu Ser Ser Glu Gin 

390 395 400 

atg tea aga aca aac tat caa age ttc cac ttt aac aaa acc 12 99 

Met Ser Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 
405 410 415 

tgaattcttt cagagctgac tctcctctat gcctcaaaac ttcagagagg aacatcccat 1359 

aatgtatgcc ttctcatatg atattagaga ggtagaatgg ctcttacaac tcatgtaccc 1419 

attgctagtt tttttttttt aataaacgtg aaaaetgaga gttagatctg gtttcaaaac 1479 

ccaagactgc ctgattttta gttatctttc cactatccta actgcctcat gceecttcac 1539 

tagttcatgc caagaacgtg actggaaagg catggcacct ataccttgat taatttccat 1599 

taatggaaat ggttcgtcct gagtcatcta cgttcegagt caggctgtca etcctacta 1658 



<210> 2 

<211> 415 

<212> PRT 

<213> Homo sapiens 



<400> 2 

Met Ser Gly Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin 
15 10 15 

Lys Leu Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr 
20 25 30 

Leu Ala Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val 
35 40 45 

Ser Val Val Tyr Val Pro lie Phe Val Val Gly Val lie Gly Asn Val 
50 55 60 

Leu Val Cys Leu Val lie Leu Gin His Gin Ala Met Lys Thr Pro Thr 
65 70 75 80 



Asn Tyr Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu 

85 90 95 



Leu Gly Met Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe 
100 105 110 



Leu Phe Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr 
115 120 125 

Val Cys Phe Ala Ser lie Leu Ser lie Thr Thr Val Ser Val Glu Arg 
130 135 140 

Tyr Val Ala lie Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg 
145 150 155 160 

Arg Arg Ala Leu Arg lie Leu Gly lie Val Trp Gly Phe Ser Val Leu 

165 170 175 

Phe Ser Leu Pro Asn Thr Ser lie His Gly lie Lys Phe His Tyr Phe 
180 185 190 

Pro Asn Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val lie Lys 
195 200 205 

Pro Met Trp lie Tyr Asn Phe lie lie Gin Val Thr Ser Phe Leu Phe 
210 215 220 

Tyr Leu Leu Pro Met Thr Val lie Ser Val Leu Tyr Tyr Leu Met Ala 
225 230 235 240 

Leu Arg Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala 

245 250 255 

Asn lie Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val 
260 265 270 

Leu Val Leu Val Phe Ala lie Cys Trp Ala Pro Phe His lie Asp Arg 
275 280 285 

Leu Phe Phe Ser Phe Val Glu Glu Trp Ser Glu Ser Leu Ala Ala Val 
290 295 300 

Phe Asn Leu Val His Val Val Ser Gly Val Phe Phe Tyr Leu Ser Ser 
305 310 315 320 

Ala Val Asn Pro lie lie Tyr Asn Leu Leu Ser Arg Arg Phe Gin Ala 

325 330 335 

Ala Phe Gin Asn Val lie Ser Ser Phe His Lys Gin Trp His Ser Gin 
340 345 350 

His Asp Pro Gin Leu Pro Pro Ala Gin Arg Asn lie Phe Leu Thr Glu 
355 360 365 

Cys His Phe Val Glu Leu Thr Glu Asp lie Gly Pro Gin Phe Pro Cys 
370 375 380 



Gin Ser Ser Met His Asn Ser His Leu Pro Thr Ala Leu Ser Ser Glu 
385 390 395 400 



Gin Met Ser Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 

405 410 415 



<210> 3 

<211> 1658 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (64) . . (1299) 

<223> IGS4A short version 

<400> 3 

ggctcagctt gaaacagagc ctcgtaccag gggaggctca ggccttggat tttaatgtca 60 

ggg atg gaa aaa ctt cag aat get tec tgg ate tac eag cag aaa eta 108 
Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Lys Leu 
15 10 15 

gaa gat cea tte cag aaa eac ctg aac age ace gag gag tat ctg gee 156 
Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu Ala 

20 25 30 

tte etc tge gga cet egg egc age eac tte tte etc ccc gtg tet gtg 204 
Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val Ser Val 
35 40 45 

gtg tat gtg cca att ttt gtg gtg ggg gtc att ggc aat gtc ctg gtg 252 
Val Tyr Val Pro lie Phe Val Val Gly Val lie Gly Asn Val Leu Val 
50 55 60 

tge ctg gtg att ctg cag cac cag get atg aag acg ccc acc aac tac 3 00 
Cys Leu Val lie Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn Tyr 
65 70 75 

tac etc tte age ctg gcg gtc tct gac etc ctg gtc ctg etc ctt gga 348 
Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu Gly 
80 85 90 95 

atg ccc ctg gag gtc tat gag atg tgg egc aac tac cct tte ttg tte 396 
Met Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe Leu Phe 

100 105 110 

ggg ccc gtg ggc tge tac tte aag acg gee etc ttt gag ace gtg tge 444 
Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val Cys 
115 120 125 

tte gee tee ate etc age ate acc acc gtc age gtg gag egc tac gtg 492 
Phe Ala Ser lie Leu Ser lie Thr Thr Val Ser Val Glu Arg Tyr Val 
130 135 140 



gcc ate eta cac ccg tte egc gee aaa ctg cag age acc egg egc egg 
Ala lie Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg Arg 
145 150 155 



540 
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gcc etc agg ate etc ggc ate gtc tgg ggc ttc tec gtg etc ttc tec 588 
Ala Leu Arg lie Leu Gly lie Val Trp Gly Phe Ser Val Leu Phe Ser 
160 165 170 175 

etg cec aac ace age ate eat ggc ate aag ttc cac tac ttc ecc aat 63 6 
Leu Pro Asn Thr Ser lie His Gly lie Lys Phe His Tyr Phe Pro Asn 

180 185 190 

ggg tec etg gtc eca ggt teg gee ace tgt aeg gtc ate aag ecc atg 6 84 
Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val lie Lys Pro Met 
195 200 205 

tgg ate tac aat ttc ate ate eag gtc ace tee ttc eta ttc tac etc 732 
Trp lie Tyr Asn Phe lie lie Gin Val Thr Ser Phe Leu Phe Tyr Leu 
210 215 220 

etc ecc atg act gtc ate agt gtc etc tac tac etc atg gca etc aga 780 
Leu Pro Met Thr Val lie Ser Val Leu Tyr Tyr Leu Met Ala Leu Arg 
225 230 235 

eta aag aaa gac aaa tct ctt gag gca gat gaa ggg aat gca aat att 82 8 
Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn lie 
240 245 250 255 

caa aga ecc tgc aga aaa tea gtc aac aag atg etg ttt gtc ttg gtc 876 
Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val Leu Val 

260 265 270 

tta gtg ttt get ate tgt tgg gcc ccg ttc cac att gac cga etc ttc 924 
Leu Val Phe Ala lie Cys Trp Ala Pro Phe His lie Asp Arg Leu Phe 
275 280 285 

ttc age ttt gtg gag gag tgg agt gaa tec etg get get gtg ttc aac 972 
Phe Ser Phe Val Glu Glu Trp Ser Glu Ser Leu Ala Ala Val Phe Asn 
290 295 300 

etc gtc cat gtg gtg tea ggt gtc ttc ttc tac etg age tea get gtc 1020 
Leu Val His Val Val Ser Gly Val Phe Phe Tyr Leu Ser Ser Ala Val 
305 310 315 

aac ecc att ate tat aac eta etg tct cgc cgc ttc eag gca gca ttc 1068 
Asn Pro lie lie Tyr Asn Leu Leu Ser Arg Arg Phe Gin Ala Ala Phe 
320 325 330 335 

eag aat gtg ate tct tct ttc cac aaa cag tgg cac tec cag cat gac 1116 
Gin Asn Val He Ser Ser Phe His Lys Gin Trp His Ser Gin His Asp 

340 345 350 

eca cag ttg cca ect gcc cag egg aac ate ttc etg aca gaa tgc cac 1164 
Pro Gin Leu Pro Pro Ala Gin Arg Asn He Phe Leu Thr Glu Cys His 
355 360 365 

ttt gtg gag etg ace gaa gat ata ggt ecc caa ttc cca tgt cag tea 1212 
Phe Val Glu Leu Thr Glu Asp He Gly Pro Gin Phe Pro Cys Gin Ser 
370 375 380 



-I' ^^p^ i'P'^ J'*^' 



tec atg cac aac tct cac etc cea aea gee etc tct agt gaa cag atg 1260 
Ser Met His Asn Ser His Leu Pro Thr Ala Leu Ser Ser Glu Gin Met 
385 390 395 

tea aga aea aac tat caa age ttc cac ttt aac aaa aec tgaattcttt 1309 
Ser Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 
400 405 410 

cagagctgac tctectctat gcctcaaaac ttcagagagg aaeatcccat aatgtatgcc 1369 
ttctcatatg atattagaga ggtagaatgg ctcttacaac tcatgtaccc attgctagtt 142 9 
tttttttttt aataaaegtg aaaactgaga gttagatctg gtttcaaaac ccaagactge 1489 
ctgattttta gttatctttc cactatccta aetgcctcat gccccttcac tagttcatgc 1549 
caagaaegtg aetggaaagg catggcacct ataccttgat taatttccat taatggaaat 1609 
ggttcgtcct gagtcatcta cgttccgagt caggctgtca ctcctacta 1658 



<210> 4 
<211> 412 
<212> PRT 

<213> Homo sapiens 

<400> 4 

Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Lys Leu Glu 
15 10 15 

Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu Ala Phe 
20 25 30 

Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val Ser Val Val 
35 40 45 

Tyr Val Pro lie Phe Val Val Gly Val lie Gly Asn Val Leu Val Cys 
50 55 60 

Leu Val lie Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn Tyr Tyr 
65 70 75 80 

Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu Gly Met 

85 90 95 

Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe Leu Phe Gly 
100 105 110 

Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val Cys Phe 
115 120 125 

Ala Ser lie Leu Ser lie Thr Thr Val Ser Val Glu Arg Tyr Val Ala 
130 135 140 

lie Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg Arg Ala 
145 150 155 160 
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Leu Arg lie Leu 



Pro Asn Thr Ser 
180 

Ser Leu Val Pro 
195 

lie Tyr Asn Phe 
210 

Pro Met Thr Val 
225 



Gly lie Val Trp 
165 

lie His Gly lie 



Gly Ser Ala Thr 
200 

lie lie Gin Val 
215 

lie Ser Val Leu 
230 



Gly Phe Ser Val 
170 

Lys Phe His Tyr 
185 

Cys Thr Val He 



Thr Ser Phe Leu 
220 

Tyr Tyr Leu Met 
235 



Leu Phe Ser Leu 
175 

Phe Pro Asn Gly 
190 

Lys Pro Met Trp 
205 

Phe Tyr Leu Leu 



Ala Leu Arg Leu 
240 



Lys Lys Asp Lys 



Arg Pro Cys Arg 
260 

Val Phe Ala He 
275 

Ser Phe Val Glu 
290 

Val His Val Val 
305 

Pro He He Tyr 



Asn Val He Ser 
340 

Gin Leu Pro Pro 
355 

Val Glu Leu Thr 
370 

Met His Asn Ser 
385 

Arg Thr Asn Tyr 



Ser Leu Glu Ala 
245 

Lys Ser Val Asn 



Cys Trp Ala Pro 
280 

Glu Trp Ser Glu 
295 

Ser Gly Val Phe 
310 

Asn Leu Leu Ser 
325 

Ser Phe His Lys 



Ala Gin Arg Asn 
360 

Glu Asp He Gly 
375 

His Leu Pro Thr 
390 

Gin Ser Phe His 
405 



Asp Glu Gly Asn 
250 

Lys Met Leu Phe 
265 

Phe His He Asp 



Ser Leu Ala Ala 
300 

Phe Tyr Leu Ser 
315 

Arg Arg Phe Gin 
330 

Gin Trp His Ser 
345 

He Phe Leu Thr 



Pro Gin Phe Pro 
380 

Ala Leu Ser Ser 
395 

Phe Asn Lys Thr 
410 



Ala Asn He Gin 
255 

Val Leu Val Leu 
270 

Arg Leu Phe Phe 
285 

Val Phe Asn Leu 



Ser Ala Val Asn 
320 

Ala Ala Phe Gin 
335 

Gin His Asp Pro 
350 

Glu Cys His Phe 
365 

Cys Gin Ser Ser 



Glu Gin Met Ser 
400 



<210> 5 
<211> 1658 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 



<222> (55) . . (1299) 

<223> IGS4B long version 



<400> 5 

ggctcagctt gaaacagagc ctcgtaccag gggaggctca ggccttggat ttta atg 

Met 
1 

tea ggg atg gaa aaa ctt cag aat get tec tgg ate tac cag cag aaa 

Ser Gly Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Lys 

5 10 15 

eta gaa gat cca ttc cag aaa cac ctg aac age aec gag gag tat ctg 

Leu Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu 

20 25 30 

gee ttc etc tgc gga cct egg cgc age cac ttc ttc etc ccc gtg tet 

Ala Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val Ser 

35 40 45 

gtg gtg tat gtg cca att ttt gtg gtg ggg gtc att gge aat gtc ctg 

Val Val Tyr Val Pro lie Phe Val Val Gly Val lie Gly Asn Val Leu 

50 55 60 65 

9tg tgc ctg gtg att ctg cag cac cag get atg aag acg ccc aec aac 

Val Cys Leu Val lie Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn 

70 75 80 

tac tac etc ttc age ctg gcg gtc tct gae etc ctg gtc ctg etc ctt 

Tyr Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu 

85 90 95 

gga atg ccc ctg gag gtc tat gag atg tgg cgc aac tac cct ttc ttg 

Gly Met Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe Leu 

100 105 110 

ttc ggg ccc gtg gge tgc tac ttc aag acg gee etc ttt gag acc gtg 

Phe Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val 

115 120 125 

tgc ttc gee tec ate etc age ate aec ace gtc age gtg gag cgc tac 

Cys Phe Ala Ser lie Leu Ser lie Thr Thr Val Ser Val Glu Arg Tyr 

130 135 140 145 

gtg gcc ate eta cac eeg ttc cgc gee aaa ctg cag age aec egg cgc 

Val Ala lie Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg 

150 155 160 

egg gcc etc agg ate etc gge ate gtc tgg gge ttc tee gtg etc ttc 

Arg Ala Leu Arg lie Leu Gly lie Val Trp Gly Phe Ser Val Leu Phe 

165 170 175 

tee ctg ccc aac ace age ate eat gge ate aag ttc cac tac ttc ccc 

Ser Leu Pro Asn Thr Ser lie His Gly lie Lys Phe His Tyr Phe Pro 

180 185 190 



aat ggg tec ctg gtc cca ggt teg gee aec tgt acg gtc ate aag ccc 
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Asn Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val lie Lys Pro 
195 200 205 

atg tgg ate tac aat ttc ate ate cag gte ace tec ttc eta tte tae 729 
Met Trp lie Tyr Asn Phe lie lie Gin Val Thr Ser Phe Leu Phe Tyr 
210 215 220 225 

etc etc cec atg aet gte ate agt gte ete tac tae etc atg gea etc 777 
Leu Leu Pro Met Thr Val lie Ser Val Leu Tyr Tyr Leu Met Ala Leu 

230 235 240 

aga eta aag aaa gae aaa tet ctt gag gea gat gaa ggg aat gea aat 825 
Arg Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn 
245 250 255 

att eaa aga cec tgc aga aaa tea gte aae aag atg etg ttt gte ttg 873 
lie Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val Leu 
260 265 270 

gte tta gtg ttt get ate tgt tgg gee eeg ttc cac att gae ega ete 921 
Val Leu Val Phe Ala lie Cys Trp Ala Pro Phe His lie Asp Arg Leu 
275 280 285 

ttc tte age ttt gtg gag gag tgg act gaa tec etg get get gtg ttc 969 
Phe Phe Ser Phe Val Glu Glu Trp Thr Glu Ser Leu Ala Ala Val Phe 
290 295 300 305 

aae ete gte eat gtg gtg tea ggt gte tta ttc tae etg age tea get 1017 
Asn Leu Val His Val Val Ser Gly Val Leu Phe Tyr Leu Ser Ser Ala 

310 315 320 

gte aae cec att ate tat aac eta etg tet egc cge ttc cag gea gea 1065 
Val Asn Pro lie lie Tyr Asn Leu Leu Ser Arg Arg Phe Gin Ala Ala 
325 330 335 

ttc cag aat gtg ate tet tet ttc cac aaa cag tgg cac tec cag eat 1113 
Phe Gin Asn Val lie Ser Ser Phe His Lys Gin Trp His Ser Gin His 
340 345 350 

gae cca cag ttg cca cct gee cag egg aac ate ttc etg aca gaa tgc 1161 
Asp Pro Gin Leu Pro Pro Ala Gin Arg Asn lie Phe Leu Thr Glu Cys 
355 360 365 

cac ttt gtg gag etg ace gaa gat ata ggt cec caa ttc eta tgt cag 1209 
His Phe Val Glu Leu Thr Glu Asp lie Gly Pro Gin Phe Leu Cys Gin 
370 375 380 385 

tea tec gtg cac aae tet cac etc cca aca gcc etc tet agt gaa cag 1257 
Ser Ser Val His Asn Ser His Leu Pro Thr Ala Leu Ser Ser Glu Gin 

390 395 400 

atg tea aga aca aac tat caa age ttc cac ttt aac aaa ace 12 99 

Met Ser Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 
405 410 415 



tgaattcttt cagagctgac tetectctat gcetcaaaae ttcagagagg aacatcccat 1359 



aatgtatgcc ttctcatatg aaattagaga ggtagaatgg ctcttacaac tcatgtaccc 1419 

attgctagtt tttttttttt aataaacgtg aaaactgaga gttagatctg gtttcaaaac 1479 

ccaagactgc ctgattttta gttatctttc cactatccta actgcctcat gccccttcac 1539 

tagttcatgc caagaacgtg actggaaagg catggcacct ataccttgat taatttccat 1599 

taatggaaat ggttcgtcct gagtcatcta cgttccgagt caggctgtca ctcctacta 1658 

<210> 6 

<211> 415 

<212> PRT 

<213> Homo sapiens 

<400> 6 

Met Ser Gly Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin 
15 10 15 

Lys Leu Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr 
20 25 30 

Leu Ala Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val 
35 40 45 

Ser Val Val Tyr Val Pro lie Phe Val Val Gly Val lie Gly Asn Val 
50 55 60 

Leu Val Cys Leu Val lie Leu Gin His Gin Ala Met Lys Thr Pro Thr 
65 70 75 80 

Asn Tyr Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu 

85 90 95 

Leu Gly Met Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe 
100 105 110 

Leu Phe Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr 
115 120 125 

Val Cys Phe Ala Ser lie Leu Ser lie Thr Thr Val Ser Val Glu Arg 
130 135 140 

Tyr Val Ala lie Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg 
145 150 155 160 

Arg Arg Ala Leu Arg lie Leu Gly lie Val Trp Gly Phe Ser Val Leu 

165 170 175 

Phe Ser Leu Pro Asn Thr Ser lie His Gly lie Lys Phe His Tyr Phe 
180 185 190 

Pro Asn Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val lie Lys 
195 200 205 

Pro Met Trp lie Tyr Asn Phe lie lie Gin Val Thr Ser Phe Leu Phe 
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210 215 220 

Tyr Leu Leu Pro Met Thr Val lie Ser Val Leu Tyr Tyr Leu Met Ala 
225 230 235 240 

Leu Arg Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala 

245 250 . 255 

Asn lie Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val 
260 265 270 

Leu Val Leu Val Phe Ala lie Cys Trp Ala Pro Phe His lie Asp Arg 
275 280 285 

Leu Phe Phe Ser Phe Val Glu Glu Trp Thr Glu Ser Leu Ala Ala Val 
290 295 300 

Phe Asn Leu Val His Val Val Ser Gly Val Leu Phe Tyr Leu Ser Ser 
305 310 315 320 

Ala Val Asn Pro lie lie Tyr Asn Leu Leu Ser Arg Arg Phe Gin Ala 

325 330 335 

Ala Phe Gin Asn Val lie Ser Ser Phe His Lys Gin Trp His Ser Gin 
340 345 350 

His Asp Pro Gin Leu Pro Pro Ala Gin Arg Asn lie Phe Leu Thr Glu 
355 360 365 

Cys His Phe Val Glu Leu Thr Glu Asp lie Gly Pro Gin Phe Leu Cys 
370 375 380 

Gin Ser Ser Val His Asn Ser His Leu Pro Thr Ala Leu Ser Ser Glu 
385 390 395 400 

Gin Met Ser Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 

405 410 415 



<210> 7 
<211> 1658 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (64) . . (1299) 

<223> IGS4B short version 

<400> 7 

ggctcagctt gaaacagagc ctcgtaccag gggaggctca ggccttggat tttaatgtca 60 



ggg atg gaa aaa ctt cag aat get tec tgg ate tac eag cag aaa eta 108 
Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Lys Leu 
15 10 15 
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gaa gat cca ttc cag aaa cac ctg aac age acc gag gag tat ctg gcc 156 

Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu Ala 

20' 25 30 

ttc etc tgc gga cct egg cgc age cac ttc ttc etc cec gtg tct gtg 204 

Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val Ser Val 

35 40 45 

gtg tat gtg cca att ttt gtg gtg ggg gtc att gge aat gte ctg gtg 252 

Val Tyr Val Pro lie Phe Val Val Gly Val lie Gly Asn Val Leu Val 
50 55 60 

tgc ctg gtg att ctg cag cac cag get atg aag acg ccc ace aac tac 3 00 

Cys Leu Val lie Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn Tyr 
65 70 75 

tac etc ttc age ctg gcg gte tct gac etc ctg gtc ctg etc ett gga 348 

Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu Gly 

80 85 90 95 

atg cec ctg gag gte tat gag atg tgg cgc aac tac cct ttc ttg ttc 396 

Met Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe Leu Phe 

100 105 110 

ggg cec gtg ggc tgc tac ttc aag acg gee etc ttt gag acc gtg tgc 444 

Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val Cys 

115 120 125 

ttc gee tec ate etc age ate ace acc gte age gtg gag cgc tac gtg 492 

Phe Ala Ser lie Leu Ser lie Thr Thr Val Ser Val Glu Arg Tyr Val 
130 135 140 

gcc ate eta cac ccg ttc cgc gee aaa ctg cag age ace egg cgc egg 540 

Ala lie Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg Arg 
145 150 155 

gcc etc agg ate etc ggc ate gtc tgg ggc ttc tee gtg etc ttc tec 588 

Ala Leu Arg lie Leu Gly lie Val Trp Gly Phe Ser Val Leu Phe Ser 

160 165 170 175 

ctg cec aac acc age ate cat gge ate aag ttc cac tac ttc cec aat 636 

Leu Pro Asn Thr Ser lie His Gly lie Lys Phe His Tyr Phe Pro Asn 

180 185 190 

ggg tec ctg gtc cca ggt teg gee acc tgt acg gtc ate aag cec atg 684 

Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val lie Lys Pro Met 

195 200 205 

tgg ate tac aat ttc ate ate cag gtc acc tee ttc eta ttc tac etc 732 

Trp lie Tyr Asn Phe lie lie Gin Val Thr Ser Phe Leu Phe Tyr Leu 
210 215 220 

etc cec atg act gtc ate agt gtc etc tac tac etc atg gca etc aga 780 

Leu Pro Met Thr Val lie Ser Val Leu Tyr Tyr Leu Met Ala Leu Arg 
225 230 235 

eta aag aaa gac aaa tct ett gag gca gat gaa ggg aat gca aat att 82 8 



Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn lie 
240 245 250 255 

caa aga ccc tgc aga aaa tea gtc aac aag atg ctg ttt gtc ttg gtc 876 
Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val Leu Val 

260 265 270 

tta gtg ttt get ate tgt tgg gcc ccg ttc cac att gac ega etc tte 924 
Leu Val Phe Ala lie Cys Trp Ala Pro Phe His lie Asp Arg Leu Phe 
275 280 285 

ttc age ttt gtg gag gag tgg act gaa tec ctg get get gtg tte aac 972 
Phe Ser Phe Val Glu Glu Trp Thr Glu Ser Leu Ala Ala Val Phe Asn 
290 295 300 

etc gtc eat gtg gtg tea ggt gtc tta ttc tae ctg age tea get gtc 1020 
Leu Val His Val Val Ser Gly Val Leu Phe Tyr Leu Ser Ser Ala Val 
305 310 315 

aac ccc att ate tat aac eta ctg tet cgc ege ttc cag gea gca tte 1068 
Asn Pro lie lie Tyr Asn Leu Leu Ser Arg Arg Phe Gin Ala Ala Phe 
320 325 330 335 

cag aat gtg ate tct tet ttc cac aaa cag tgg cac tec cag cat gac 1116 
Gin Asn Val lie Ser Ser Phe His Lys Gin Trp His Ser Gin His Asp 

340 345 350 

cea cag ttg cea cet gee cag egg aac ate ttc ctg aca gaa tgc cac 1164 
Pro Gin Leu Pro Pro Ala Gin Arg Asn lie Phe Leu Thr Glu Cys His 
355 360 365 

ttt gtg gag ctg ace gaa gat ata ggt ccc caa tte eta tgt cag tea 1212 
Phe Val Glu Leu Thr Glu Asp lie Gly Pro Gin Phe Leu Cys Gin Ser 
370 375 380 

tec gtg cac aac tct cac etc cea aca gcc etc tct agt gaa cag atg 1260 
Ser Val His Asn Ser His Leu Pro Thr Ala Leu Ser Ser Glu Gin Met 
385 390 395 

tea aga aca aac tat caa age ttc cac ttt aac aaa acc tgaattcttt 1309 
Ser Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 
400 405 410 

cagagctgac tctcctctat gectcaaaac tteagagagg aacatcccat aatgtatgcc 1369 
tteteatatg aaattagaga ggtagaatgg ctettacaae tcatgtaccc attgctagtt 1429 
tttttttttt aataaacgtg aaaaetgaga gttagatctg gtttcaaaac ecaagactge 1489 
ctgattttta gttatetttc cactatecta aetgeetcat gceccttcae tagttcatge 1549 
caagaacgtg actggaaagg catggcacet ataccttgat taattteeat taatggaaat 1609 
ggttegteet gagteateta egttcegagt eaggetgtea eteetacta 1658 



<210> 8 
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<211> 412 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Lys Leu Glu 
15 10 15 

Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu Ala Phe 
20 25 30 

Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val Ser Val Val 
35 40 45 

Tyr Val Pro lie Phe Val Val Gly Val lie Gly Asn Val Leu Val Cys 
50 55 60 

Leu Val lie Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn Tyr Tyr 
65 70 75 80 

Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu Gly Met 

85 90 95 

Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe Leu Phe Gly 
100 105 110 

Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val Cys Phe 
115 120 125 

Ala Ser lie Leu Ser lie Thr Thr Val Ser Val Glu Arg Tyr Val Ala 
130 135 140 

lie Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg Arg Ala 
145 150 155 160 

Leu Arg lie Leu Gly lie Val Trp Gly Phe Ser Val Leu Phe Ser Leu 

165 170 175 

Pro Asn Thr Ser lie His Gly lie Lys Phe His Tyr Phe Pro Asn Gly 
180 185 190 

Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val lie Lys Pro Met Trp 
195 200 205 

lie Tyr Asn Phe lie lie Gin Val Thr Ser Phe Leu Phe Tyr Leu Leu 
210 215 220 

Pro Met Thr Val lie Ser Val Leu Tyr Tyr Leu Met Ala Leu Arg Leu 
225 230 235 240 

Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn lie Gin 

245 250 255 

Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val Leu Val Leu 
260 265 270 



Val Phe Ala lie Cys Trp Ala Pro Phe His lie Asp Arg Leu Phe Phe 



275 



280 



285 



Ser Phe Val Glu 
290 

Val His Val Val 
305 

Pr® lie lie Tyr 



Asn Val lie Ser 
340 

Gin Leu Pro Pro 
355 

Val Glu Leu Thr 
370 

Val His Asn Ser 
385 

Arg Thr Asn Tyr 

) 



Glu Trp Thr Glu 
295 

Ser Gly Val Leu 
310 

Asn Leu Leu Ser 
325 

Ser Phe His Lys 



Ala Gin Arg Asn 
360 

Glu Asp lie Gly 
375 

His Leu Pro Thr 
390 

Gin Ser Phe His 
405 



Ser Leu Ala Ala 
300 

Phe Tyr Leu Ser 
315 

Arg Arg Phe Gin 
330 

Gin Trp His Ser 
345 

lie Phe Leu Thr 



Pro Gin Phe Leu 
380 

Ala Leu Ser Ser 
395 

Phe Asn Lys Thr 
410 



Val Phe Asn Leu 



Ser Ala Val Asn 
320 

Ala Ala Phe Gin 
335 

Gin His Asp Pro 
350 

Glu Cys His Phe 

365 

Cys Gin Ser Ser 



Glu Gin Met Ser 
400 



<210> 9 

<211> 1594 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (55) . . (942) 

<22 3> IGS4A truncated DNA long version 

<400> 9 

ggctcagctt gaaacagagc ctcgtaccag gggaggctca ggccttggat ttta atg 

Met 
1 

tea ggg atg gaa aaa ctt cag aat get tec tgg ate tac cag cag aaa 
Ser Gly Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Lys 
5 10 15 

eta gaa gat cca ttc cag aaa cac ctg aac age ace gag gag tat ctg 
Leu Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu 
20 25 30 

gee ttc etc tgc gga cct egg cge age cac ttc ttc etc cec gtg tet 
Ala Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val Ser 
35 40 45 



gtg gtg tat gtg cca att ttt gtg gtg ggg gtc att ggc aat gtc ctg 
Val Val Tyr Val Pro lie Phe Val Val Gly Val lie Gly Asn Val Leu 
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50 55 60 65 

gtg tgc ctg gtg att ctg cag cac cag get atg aag acg ccc acc aac 297 

Val Cys Leu Val lie Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn 

70 75 80 

tac tac etc ttc age ctg gcg gtc tct gac etc ctg gte ctg etc ett 345 

Tyr Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu 

85 90 95 

gga atg ccc ctg gag gte tat gag atg tgg egc aac tac ect ttc ttg 393 

Gly Met Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe Leu 

100 105 110 

ttc ggg ccc gtg ggc tgc tac ttc aag acg gee etc ttt gag ace gtg 441 

Phe Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val 
115 120 125 

tgc ttc gee tec ate etc age ate acc acc gtc age gtg gag egc tac 489 

Cys Phe Ala Ser lie Leu Ser lie Thr Thr Val Ser Val Glu Arg Tyr 

130 135 140 145 

gtg gee ate eta cac eeg ttc egc gee aaa ctg cag age ace egg egc 537 

Val Ala lie Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg 

150 155 160 

egg gee etc agg ate etc ggc ate gte tgg ggc ttc tec gtg etc ttc 585 

Arg Ala Leu Arg lie Leu Gly lie Val Trp Gly Phe Ser Val Leu Phe 

165 170 175 

tee ctg ccc aac ace age ate cat ggc ate aag ttc cac tac ttc ccc 633 

Ser Leu Pro Asn Thr Ser lie His Gly lie Lys Phe His Tyr Phe Pro 

180 185 190 

aat ggg tec ctg gtc cca ggt teg gee ace tgt acg gtc ate aag ccc 681 

Asn Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val lie Lys Pro 
195 200 205 

atg tgg ate tac aat ttc ate ate cag gtc acc tec ttc eta ttc tac 729 

Met Trp lie Tyr Asn Phe lie lie Gin Val Thr Ser Phe Leu Phe Tyr 

210 215 220 225 

etc etc ccc atg act gtc ate agt gtc etc tac tac etc atg gca etc 777 

Leu Leu Pro Met Thr Val lie Ser Val Leu Tyr Tyr Leu Met Ala Leu 

230 235 240 

aga eta aag aaa gac aaa tct ett gag gca gat gaa ggg aat gca aat 825 

Arg Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn 

245 250 255 

att caa aga ccc tgc aga aaa tea gtc aac aag atg ctg tct ttg tgg 873 

lie Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Ser Leu Trp 

260 265 270 

agg agt gga gtg aat ccc tgg ctg ctg tgt tea ace teg tec atg tgg 921 

Arg Ser Gly Val Asn Pro Trp Leu Leu Cys Ser Thr Ser Ser Met Trp 
275 280 285 



tgt cag gtg tct tct tct acc tgagctcagc tgtcaacccc attatctata 972 
Cys Gin Val Ser Ser Ser Thr 
290 295 

acctactgtc tcgccgcttc caggcagcat tccagaatgt gatctcttct ttccacaaac 1032 
agtggcactc ccagcatgac ccacagttgc cacctgccca gcggaacatc ttcctgacag 1092 
aatgccactt tgtggagctg accgaagata taggtcccca attcccatgt cagtcatcca 1152 
tgcacaactc tcacctccca acagccctct ctagtgaaca gatgtcaaga acaaactatc 1212 
aaagcttcca ctttaacaaa acctgaattc tttcagagct gactctcctc tatgcctcaa 1272 
aacttcagag aggaacatcc cataatgtat gccttctcat atgatattag agaggtagaa 1332 
tggctcttac aactcatgta cccattgcta gttttttttt tttaataaac gtgaaaactg 1392 
agagttagat ctggtttcaa aacccaagac tgcctgattt ttagttatct ttccactatc 1452 
ctaactgcct catgcccctt cactagttca tgccaagaac gtgactggaa aggcatggca 1512 
cctatacctt gattaatttc cattaatgga aatggttcgt cctgagtcat ctacgttccg 1572 
agtcaggctg tcactcctac ta 1594 



<210> 10 

<211> 296 
<212> PRT 

<213> Homo sapiens 
<400> 10 

Met Ser Gly Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin 
15 10 15 

Lys Leu Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr 
20 25 30 

Leu Ala Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val 
35 40 45 

Ser Val Val Tyr Val Pro lie Phe Val Val Gly Val lie Gly Asn Val 
50 55 60 

Leu Val Cys Leu Val lie Leu Gin His Gin Ala Met Lys Thr Pro Thr 
65 70 75 80 

Asn Tyr Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu 

85 90 95 

Leu Gly Met Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe 
100 105 110 

Leu Phe Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr 
115 120 125 



Val Cys Phe Ala Ser lie Leu Ser lie Thr Thr Val Ser Val Glu Arg 
130 135 140 



Tyr Val Ala lie 
145 

Arg Arg Ala Leu 



Phe Ser Leu Pro 
180 

Pro Asn Gly Ser 
195 

Pro Met Trp lie 
210 

Tyr Leu Leu Pro 
225 

Leu Arg Leu Lys 



Asn lie Gin Arg 
260 

Trp Arg Ser Gly 
275 

Trp Cys Gin Val 
290 



Leu His Pro Phe 
150 

Arg lie Leu Gly 
165 

Asn Thr Ser lie 



Leu Val Pro Gly 
200 

Tyr Asn Phe lie 
215 

Met Thr Val lie 
230 

Lys Asp Lys Ser 
245 

Pro Cys Arg Lys 



Val Asn Pro Trp 
280 

Ser Ser Ser Thr 
295 



Arg Ala Lys Leu 
155 

lie Val Trp Gly 
170 

His Gly lie Lys 
185 

Ser Ala Thr Cys 



lie Gin Val Thr 
220 

Ser Val Leu Tyr 
235 

Leu Glu Ala Asp 
250 

Ser Val Asn Lys 
265 

Leu Leu Cys Ser 



Gin Ser Thr Arg 
160 

Phe Ser Val Leu 
175 

Phe His Tyr Phe 
190 

Thr Val lie Lys 
205 

Ser Phe Leu Phe 



Tyr Leu Met Ala 
240 

Glu Gly Asn Ala 
255 

Met Leu Ser Leu 
270 

Thr Ser Ser Met 
285 



<210> 11 

<211> 1594 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (64) . . (942) 

<223> IGS4A truncated DNA short version 
<400> 11 

ggctcagctt gaaacagagc ctcgtaccag gggaggctca ggccttggat tttaatgtca 

ggg atg gaa aaa ctt cag aat get tec tgg ate tac cag cag aaa eta 
Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Lys Leu 
15 10 15 



gaa gat eca ttc cag aaa cae ctg aac age ace gag gag tat etg gee 
Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu Ala 

20 25 30 
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ttc etc tgc gga cct egg cgc age cac ttc ttc ete ece gtg tct gtg 204 
Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val Ser Val 
35 40 45 

gtg tat gtg cca att ttt gtg gtg ggg gtc att ggc aat gtc ctg gtg 252 
Val Tyr Val Pro lie Phe Val Val Gly Val lie Gly Asn Val Leu Val 
50 55 60 

tgc ctg gtg att ctg cag cac cag get atg aag aeg ccc acc aac tac 300 
Cys Leu Val lie Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn Tyr 
65 70 75 

tac etc ttc age ctg gcg gtc tct gae etc ctg gtc ctg etc ctt gga 348 
Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu Gly 
80 85 90 95 

atg ccc ctg gag gtc tat gag atg tgg cgc aac tac cct ttc ttg ttc 3 96 
Met Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe Leu Phe 

100 105 110 

ggg ccc gtg ggc tgc tac ttc aag aeg gee etc ttt gag acc gtg tgc 444 
Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val Cys 
115 120 125 

ttc gee tec ate etc age ate ace acc gtc age gtg gag cgc tac gtg 492 
Phe Ala Ser lie Leu Ser lie Thr Thr Val Ser Val Glu Arg Tyr Val 
130 135 140 

gee ate eta cac ceg ttc cgc gee aaa ctg cag age acc egg cgc egg 540 
Ala lie Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg Arg 
145 150 155 

gee etc agg ate etc ggc ate gtc tgg ggc ttc tee gtg etc ttc tec 588 
Ala Leu Arg lie Leu Gly lie Val Trp Gly Phe Ser Val Leu Phe Ser 
160 165 170 175 

ctg ccc aac ace age ate eat ggc ate aag ttc cac tac ttc ccc aat 636 
Leu Pro Asn Thr Ser lie His Gly lie Lys Phe His Tyr Phe Pro Asn 

180 185 190 

ggg tee ctg gtc cca ggt teg gee ace tgt aeg gtc ate aag ccc atg 684 
Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val lie Lys Pro Met 
195 200 205 

tgg ate tac aat ttc ate ate cag gtc acc tee ttc eta ttc tac etc 732 
Trp lie Tyr Asn Phe lie lie Gin Val Thr Ser Phe Leu Phe Tyr Leu 
210 215 220 

etc ccc atg act gtc ate agt gtc etc tac tac etc atg gea etc aga 780 
Leu Pro Met Thr Val lie Ser Val Leu Tyr Tyr Leu Met Ala Leu Arg 
225 230 235 

eta aag aaa gae aaa tct ctt gag gea gat gaa ggg aat gea aat att 828 
Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn lie 
240 245 250 255 

caa aga ece tgc aga aaa tea gtc aac aag atg ctg tct ttg tgg agg 876 
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Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Ser Leu Trp Arg 

260 265 270 

agt gga gtg aat ccc tgg ctg ctg tgt tea acc teg tec atg tgg tgt 924 
Ser Gly Val Asn Pro Trp Leu Leu Cys Ser Thr Ser Ser Met Trp Cys 
275 280 285 

eag gtg tct tct tct acc tgagctcagc tgtcaacccc attatctata 972 
Gin Val Ser Ser Ser Thr 
290 



acctactgtc 


tcgccgcttc 


caggcagcat 


tccagaatgt 


gatctcttct 


ttccacaaac 


1032 


agtggcactc 


ccagcatgac 


ccacagttgc 


cacctgccca 


gcggaacatc 


ttcctgacag 


1092 


aatgccactt 


tgtggagctg 


accgaagata 


taggteeeca 


attcccatgt 


cagtcatcca 


1152 


tgcacaactc 


tcacctccca 


acagccctct 


ctagtgaaca 


gatgtcaaga 


acaaactatc 


1212 


aaagcttcca 


ctttaacaaa 


acctgaatte 


ttteagagct 


gactctcctc 


tatgcctcaa 


1272 


aacttcagag 


aggaacatcc 


cataatgtat 


gccttctcat 


atgatattag 


agaggtagaa 


1332 


tggctcttac 


aactcatgta 


cecattgcta 


gttttttttt 


tttaataaac 


gtgaaaactg 


1392 


agagttagat 


ctggtttcaa 


aacccaagac 


tgcctgattt 


ttagttatct 


ttccactatc 


1452 


ctaactgcct 


eatgcccctt 


caetagttca 


tgccaagaac 


gtgactggaa 


aggcatggca 


1512 


cctatacctt 


gattaatttc 


cattaatgga 


aatggttcgt 


cctgagtcat 


ctacgttccg 


1572 


agtcaggctg 


tcactcctac 


ta 








1594 



<210> 12 
<211> 293 
<212> PRT 

<213> Homo sapiens 
<400> 12 

Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Lys Leu Glu 
15 10 15 

Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu Ala Phe 
20 25 30 

Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val Ser Val Val 
35 40 45 

Tyr Val Pro lie Phe Val Val Gly Val lie Gly Asn Val Leu Val Cys 
50 55 60 

Leu Val lie Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn Tyr Tyr 
65 70 75 80 

Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu Gly Met 

85 90 95 



Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe Leu Phe Gly 
100 105 110 

Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val Cys Phe 
115 120 125 

Ala Ser lie Leu Ser lie Thr Thr Val Ser Val Glu Arg Tyr Val Ala 
130 135 140 

lie Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg Arg Ala 
145 150 155 160 

Leu Arg lie Leu Gly lie Val Trp Gly Phe Ser Val Leu Phe Ser Leu 

165 170 175 

Pro Asn Thr Ser lie His Gly lie Lys Phe His Tyr Phe Pro Asn Gly 
180 185 190 

Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val lie Lys Pro Met Trp 
195 200 205 

lie Tyr Asn Phe lie lie Gin Val Thr Ser Phe Leu Phe Tyr Leu Leu 
210 215 220 

Pro Met Thr Val lie Ser Val Leu Tyr Tyr Leu Met Ala Leu Arg Leu 
225 230 235 240 

Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn lie Gin 

245 250 255 

Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Ser Leu Trp Arg Ser 
260 265 270 

Gly Val Asn Pro Trp Leu Leu Cys Ser Thr Ser Ser Met Trp Cys Gin 
275 280 285 

Val Ser Ser Ser Thr 
290 



<210> 13 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Degenerated 
primers 

<220> 

<221> variation 

<222> (21) 

<223> A,C,G or T 



<220> 



<221> variation 

<222> (24) 

<223> A, C, G or T 

<400> 13 

ctcatcttcg cggtgggcrc ngyngg 



<210> 14 
<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Degenerated 
primers 

<220> 

<221> variation 

<222> (22) 

<223> C or Inosine 

<220> 

<221> variation 

<222> (25) 

<223> A, C, G or T 

<220> 

<221> variation 

<222> (28) 

<223> A, C, G or T 

<400> 14 

ggccaggcag cgctccgcgc tnarncyngc d 

<210> 15 

<211> 22 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Degenerated 
primers 

<400> 15 

gaartartag ccrcgrcagc cw 

<210> 16 
<211> 27 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 



<400> 16 

ccatcctaat acgactcact atagggc 



<210> 17 
<211> 23 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 



<400> 17 

actcactata gggctcgagc ggc 



<210> 18 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 18 

ggatcccaaa taagaaaggg tagttgc 



<210> 19 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 19 

aaagggtagt tgcgccacat ctcatagac 



<210> 20 

<211> 29 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 



<400> 20 

aggtctatga gatgtggcgc aactaccct 



<210> 21 
<211> 30 
<212> DNA 

<213> Artificial Sequence 




<220> 

<223> Description of Artificial Sequence: Primer 



<400> 21 

atgtggcgca actacccttt cttatttggg 



30 



<210> 22 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Degenerated 
primers 

<400> 22 

cggaagttgg cggacacgrv rttrta 2 6 



<210> 23 
<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<210> 24 

<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 24 

ccatgtggat ctacaatttc atcatcc 27 



<210> 25 
<211> 29 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Primer 



<400> 23 

gctcagcttg aaacagagcc tcgtacc 



27 



<400> 25 

aagacaaatc tcttgaggca gatgaaggg 



29 



<210> 26 
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<211> 30 
<212> DNA 

<213> Artificial Secfuence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 26 

gatgctgttt gtcttggtct tagtgtttgc 30 

<210> 27 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<210> 28 
<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 28 

tgtggagaag tctctcaaag tgtgg 2 5 

<210> 29 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<210> 30 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 27 

ggatgatgaa attgtagatc cacatgggc 



29 



<400> 29 

tagtaggagt gacagcctga ctcggaacg 



29 



<400> 30 

aacgtagatg actcaggacg aaccatttcc 



30 



<210> 31 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 31 

tcgtaccagg ggaggctcag gc 22 



<210> 32 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 32 

cctcttcagc ctggcggtct ctg 23 



<210> 33 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 33 

ggaggcgaag cacacggtct ca 22 



<210> 34 
<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<220> 

<221> misc__binding 

<222> (1) 

<223> Labeled with 6-carboxyf luorescein 
<220> 

<221> misc_binding 

<222> (34) 

<223> Labeled with 

N,N,N' ,N' -tetramethyl-6-carboxyrhodamin 



<400> 34 

agatgtggcg caactaccct ttcttgttcg ggcc 34 



woAi/7^9M PCT/EPOO/09584 

^'^^^^^ JC13 Rec'd PCT/PTO 2 2 MAR 2002 

1/28 

SEQUENCE LISTING 

<110> SOLVAY PHARMACEOTICALS B.V- 

<120> Novel human G-protein coupled Receptor 

<130> SPW99.06/HA 00.19 

<140> 
<141> 

<160> 34 

<170> Patentln Ver. 2.1 

<210> 1 

<211> 1658 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (55) . . (1299) 

<223> IGS4A long version 

<400> 1 

ggctcagctt gaaacagagc ctcgtaccag gggaggctca ggccttggat ttta atg 57 

Met 
1 

tea ggg atg gaa aaa ctt cag aat get tec tgg ate tac cag eag aaa 105 
Ser Gly Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Lys 

5 10 15 

eta gaa gat cca tte cag aaa eac etg aac age aec gag gag tat ctg 153 
Leu Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu 
20 25 30 

gcc tte etc tgc gga cot egg ege age eac tte tte etc cec gtg tet 201 
Ala Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val Ser 
35 40 45 

gtg gtg tat gtg eea att ttt gtg gtg ggg gtc att ggc aat gte ctg 249 
Val Val Tyr Val Pro lie Phe Val Val Gly Val He Gly Asn Val Leu 
50 55 60 65 

gtg tgc ctg gtg att ctg cag eac cag get atg aag acg cec ace aac 297 
Val Cys Leu Val He Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn 

70 75 80 



tac tac etc tte' age ctg gcg gtc tet gae etc ctg gtc ctg etc ctt 
Tyr Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu 
85 90 95 



345 



gga atg cec ctg gag gtc tat gag atg tgg ege aac tac cet tte ttg 393 
Gly Met Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe Leu 



wo 01/25269 
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2/28 

100 105 110 

ttc ggg ccc gtg ggc tgc tac ttc aag acg gcc etc ttt gag acc gtg 441 
Phe Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val 
lis 12 0 125 

tgc ttc gcc tec ate etc age ate acc acc gtc age gtg gag cgc tac 489 
Cys Phe Ala Ser lie Leu Ser lie Thr Thr Val Ser Val Glu Arg Tyr 
130 135 140 145 

gtg gcc ate eta cac ccg ttc cgc gcc aaa etg eag age acc egg cgc 537 
Val Ala He Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg 

150 155 160 

egg gcc etc agg ate etc ggc ate gtc tgg ggc ttc tec gtg etc ttc 585 
Arg Ala Leu Arg He Leu Gly He Val Trp Gly Phe Ser Val Leu Phe 
165 170 175 

tec ctg ccc aac acc age ate cat ggc ate aag ttc cac tac ttc ccc 633 
Ser Leu Pro Asn Thr Ser He His Gly He Lys Phe His Tyr Phe Pro 
180 185 190 



aat ggg tec etg gtc cca ggt teg gee acc tgt acg gtc ate aag ccc 
Asn Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val He Lys Pro 
195 200 205 

atg tgg ate tac aat ttc ate ate eag gtc acc tec ttc eta ttc tac 
Met Trp He Tyr Asn Phe He He Gin Val Thr Ser Phe Leu Phe Tyr 
210 215 220 225 

etc etc ccc atg act gtc ate agt gtc etc tac tac etc atg gea etc 
Leu Leu Pro Met Thr Val He Ser Val Leu Tyr Tyr Leu Met Ala Leu 

230 235 240 

aga eta aag aaa gac aaa tet ett gag gca gat gaa ggg aat gea aat 
Arg Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn 
245 250 255 

att caa aga ccc tgc aga aaa tea gtc aac aag atg ctg ttt gtc ttg 
He Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val Leu 
260 265 270 

gtc tta gtg ttt get ate tgt tgg gcc ccg ttc cac att gac ega etc 
Val Leu Val Phe Ala He Cys Trp Ala Pro Phe His He Asp Arg Leu 
275 280 285 

ttc ttc age ttt gtg gag gag tgg agt gaa tec etg get get gtg ttc 
Phe Phe Ser Phe Val Glu Glu Trp Ser Glu Ser Leu Ala Ala Val Phe 
290 295 300 305 

aac etc gtc cat gtg gtg tea ggt gtc ttc ttc tac ctg age tea get 
Asn Leu Val His Val Val Ser Gly Val Phe Phe Tyr Leu Ser Ser Ala 

310 315 320 



681 



729 



777 



825 



873 



921 



969 



1017 



gtc aac ccc att ate tat aac eta ctg tct cgc cgc ttc eag gca gca 
Val Asn Pro He He Tyr Asn Leu Leu Ser Arg Arg Phe Gin Ala Ala 



1065 
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325 330 335 

ttc cag aat gtg ate tct tct ttc cac aaa cag tgg cac t<:c cag cat 1113 

Phe Gin Asn Val lie Ser Ser Phe His Lys Gin Trp His Ser Gin His 
340 345 350 

gac cca cag ttg cca cct gcc cag egg aac ate ttc etg aca gaa tgc 1161 

Asp Pro Gin Leu Pro Pro Ala Gin Arg Asn lie Phe Leu Thr Glu Cys 

355 360 365 



cac ttt gtg gag ctg acc gaa gat ata ggt ecc caa ttc cca tgt cag 
His Phe Val Glu Leu Thr Glu Asp He Gly Pro Gin Phe Pro Cys Gin 
370 375 380 385 



1209 



tea tec atg cac aac tct cac etc cca aca gcc etc tct agt gaa cag 1257 
Ser Ser Met His Asn Ser His Leu Pro Thr Ala Leu Ser Ser Glu Gin 

390 395 400 

atg tea aga aca aac tat caa age ttc cac ttt aac aaa acc 12 99 

Met Ser Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 
405 410 415 

tgaattcttt cagagctgac tctcctctat gcctcaaaac ttcagagagg aacatcccat 1359 

aatgtatgce ttctcatatg atattagaga ggtagaatgg ctcttaeaac tcatgtacec 1419 

attgctagtt tttttttttt aataaacgtg aaaactgaga gttagatctg gtttcaaaac 1479 

ccaagactgc ctgattttta gttatctttc cactatccta actgcctcat gccccttcac 153 9 

tagttcatgc caagaaegtg actggaaagg eatggcacct ataccttgat taatttccat 1599 

taatggaaat ggttcgtcct gagtcatcta cgttccgagt caggctgtca ctcctacta 1658 



<210> 2 
<211> 415 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met Ser Gly Met Glu Lys Leu Gin Asn Ala Ser Trp He Tyr Gin Gin 
15 10 15 

Lys Leu Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr 
20 25 30 

Leu Ala Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val 
35 40 45 

Ser Val Val Tyr Val Pro He Phe Val Val Gly Val He Gly Asn Val 
50 55 60 



Leu Val Cys Leu Val He Leu Gin His Gin Ala Met Lys Thr Pro Thr 
65 70 75 80 
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Asn Tyr Tyx Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu 

85 90 95 

Leu Gly Met Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe 
100 105 110 

Leu Phe Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr 
115 120 125 

Val Cys Phe Ala Ser lie Leu Ser lie Thr Thr Val Ser Val Glu Arg 
130 135 140 

Tyr Val Ala He Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg 
145 150 155 160 

Arg Arg Ala Leu Arg He Leu Gly He Val Trp Gly Phe Ser Val Leu 

165 170 175 

Phe Ser Leu Pro Asn Thr Ser He His Gly He Lys Phe His Tyr Phe 
180 185 190 

Pro Asn Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val He Lys 
195 200 205 

Pro Met Trp He Tyr Asn Phe He He Gin Val Thr Ser Phe Leu Phe 
210 215 220 

Tyr Leu Leu Pro Met Thr Val He Ser Val Leu Tyr Tyr Leu Met Ala 
225 230 235 240 

Leu Arg Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala 

245 250 255 

Asn He Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val 
260 265 270 

Leu Val Leu Val Phe Ala He Cys Trp Ala Pro Phe His He Asp Arg 
275 280 285 

Leu Phe Phe Ser Phe Val Glu Glu Trp Ser Glu Ser Leu Ala Ala Val 
290 295 300 

Phe Asn Leu Val His Val Val Ser Gly Val Phe Phe Tyr Leu Ser Ser 
305 310 315 320 

Ala Val Asn Pro He He Tyr Asn Leu Leu Ser Arg Arg Phe Gin Ala 

325 330 335 

Ala Phe Gin Asn Val He Ser Ser Phe His Lys Gin Trp His Ser Gin 
340 345 350 

His Asp Pro Gin Leu Pro Pro Ala Gin Arg Asn He Phe Leu Thr Glu 
355 360 365 



Cys His Phe Val Glu Leu Thr Glu Asp He Gly Pro Gin Phe Pro Cys 
370 375 380 
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Gin Ser Ser Met His Asn Ser His Leu Pro Thr Ala Leu Ser Ser Glu 
385 390 395 400 

Gin Met Ser Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 

405 410 415 



<210> 3 
<211> 1658 
<212> DNA 

<213> Homo sapiens ' 

<220> 

<221> CDS 

<222> (64) , . (1299) 

<223> IGS4A short version 

<400> 3 

ggctcagctt gaaacagagc ctcgtaccag gggaggctca ggccttggat tttaatgtca 60 

ggg atg gaa aaa ctt cag aat get tec tgg ate tac cag cag aaa eta 108 
Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Lys Leu 
15 10 15 

gaa gat cea tte eag aaa eac ctg aae age acc gag gag tat ctg gee 156 
Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu Ala 

20 25 30 

ttc etc tgc gga cct egg cgc age eac tte tte etc cce gtg tct gtg 2 04 
Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val Ser Val 
35 40 45 

gtg tat gtg cea att ttt gtg gtg ggg gte att ggc aat gtc ctg gtg 252 
Val Tyr Val Pro He Phe Val Val Gly Val He Gly Asn Val Leu Val 
50 55 60 

tgc ctg gtg att ctg cag eac cag get atg aag aeg cce ace aae tac 300 
Cys Leu Val He Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn Tyr 
65 70 75 

tac etc ttc age ctg gcg gtc tct gae etc ctg gtc ctg etc ctt gga 34 8 
Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu Gly 
80 85 90 95 

atg cce ctg gag gtc tat gag atg tgg cgc aae tac cct tte ttg ttc 3 96 
Met Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe Leu Phe 

100 105 110 

ggg cce gtg ggc tgc tac ttc aag aeg gee etc ttt gag acc gtg tgc 444 
Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val Cys 
115 120 125 

ttc gee tec ate etc age ate acc acc gtc age gtg gag cgc tac gtg 492 
Phe Ala Ser He Leu Ser He Thr Thr Val Ser Val Glu Arg Tyr Val 
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130 135 140 

gcc ate eta cac ccg ttc cgc gcc aaa ctg cag age ace egg egc egg 540 
Ala He Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg Arg 
145 150 155 

gcc etc agg ate etc ggc ate gtc tgg ggc ttc tee gtg etc ttc tec 588 
Ala Leu Arg He Leu Gly He Val Trp Gly Phe Ser Val Leu Phe Ser 
160 165 170 175 

ctg cec aac ace age ate cat ggc ate aag ttc cac tac ttc ecc aat 636 
Leu Pro Asn Thr Ser He His Gly He Lys Phe His Tyr Phe Pro Asn 

180 185 190 

ggg tec ctg gtc cca ggt teg gcc ace tgt aeg gtc ate aag ecc atg 684 
Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val He Lys Pro Met 
195 200 205 

tgg ate tac aat ttc ate ate cag gtc ace tec ttc eta ttc tac etc 732 
Trp He Tyr Asn Phe He He Gin Val Thr Ser Phe Leu Phe Tyr Leu 
210 215 220 

etc ecc atg act gtc ate agt gtc etc tac tac etc atg gca etc aga 780 
Leu Pro Met Thr Val He Ser Val Leu Tyr Tyr Leu Met Ala Leu Arg 
225 230 235 

eta aag aaa gae aaa tet ctt gag gca gat gaa ggg aat gca aat att 828 
Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn He 
240 245 250 255 

caa aga cec tge aga aaa tea gtc aac aag atg ctg ttt gtc ttg gtc 876 
Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val Leu Val 

260 265 270 

tta gtg ttt get ate tgt tgg gcc ccg ttc cac att gae ega etc ttc 924 
Leu Val Phe Ala He Cys Trp Ala Pro Phe His He Asp Arg Leu Phe 
275 280 285 

ttc age ttt gtg gag gag tgg agt gaa tec ctg get get gtg ttc aac 972 
Phe Ser Phe Val Glu Glu Trp Ser Glu Ser Leu Ala Ala, Val Phe Asn 
290 295 300 

etc gtc cat gtg gtg tea ggt gtc ttc ttc tac ctg age tea get gtc 1020 
Leu Val His Val Val Ser Gly Val Phe Phe Tyr Leu Ser Ser Ala Val 
305 310 315 

aac ecc att ate tat aac eta ctg tet cgc cgc ttc cag gca gca ttc 1068 
Asn Pro He He Tyr Asn Leu Leu Ser Arg Arg Phe Gin Ala Ala Phe 
320 325 330 335 

cag aat gtg ate tet tet ttc cac aaa cag tgg cac tee cag eat gae 1116 
Gin Asn Val He Ser Ser Phe His Lys Gin Tzp His Ser Gin His Asp 

340 345 350 



cca cag ttg cca cct gcc cag egg aac ate ttc ctg aca gaa tge cac 
Pro Gin Leu Pro Pro Ala Gin Arg Asn He Phe Leu Thr Glu Cys His 



1164 
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355 360 365 

ttt gtg gag ctg acc gaa gat ata ggt ccc caa ttc cca tgt cag tea 1212 
Phe Val Glu Leu Thr Glu Asp lie Gly Pro Gin Phe Pro Cys Gin Ser 
370 375 380 

tec atg cac aac tct cac etc cca aca gee etc tet agt gaa cag atg 1260 
Ser Met His Asn Ser His Leu Pro Thr Ala Leu Ser Ser Glu Gin Met 
385 390 395 

tea aga aca aae tat caa age ttc cac ttt aac aaa ace tgaattcttt 1309 
Ser Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 
400 405 410 

cagagctgac tetcctctat gectcaaaac ttcagagagg aacatcccat aatgtatgcc 1369 
ttctcatatg atattagaga ggtagaatgg ctettacaae teatgtaecc attgctagtt 1429 
tttttttttt aataaaegtg aaaactgaga gttagatctg gtttcaaaac ccaagactgc 1489 
etgattttta gttatctttc cactatccta actgcctcat gccccttcac tagttcatgc 1549 
caagaacgtg actggaaagg catggcacct ataccttgat taatttccat taatggaaat 1609 
ggttcgtcct gagtcatcta cgttccgagt caggctgtca ctcctacta 1658 



<210> 4 

<211> 412 

<212> PRT 

<213> Homo sapiens 

<400> 4 

Met Glu Lys Leu Gin Asn Ala Ser Tzrp 
1 5 

Asp Pro Phe Gin Lys His Leu Asn Ser 
20 25 

Leu Cys Gly Pro Arg Arg Ser His Phe 
35 40 

Tyr Val Pro lie Phe Val Val Gly Val 
50 55 

Leu val lie Leu Gin His Gin Ala Met 
65 70 

Leu Phe Ser Leu Ala Val Ser Asp Leu 

85 

Pro Leu Glu Val Tyr Glu Met Trp Arg 
100 105 



lie Tyr Gin Gin Lys Leu Glu 
10 15 

Thr Glu Glu Tyr Leu Ala Phe 

30 

Phe Leu Pro Val Ser Val Val" 
45 

lie Gly Asn Val Leu Val Cys 
60 

Lys Thr Pro Thr Asn Tyr Tyr 
75 80 

Leu Val Leu Leu Leu Gly Met 
90 95 

Asn Tyr Pro Phe Leu Phe Gly 

110 



Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val Cys Phe 
115 120 125 
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Ala Ser lie Leu Ser lie Thr Thr Val Ser Val Glu Arg Tyr Val Ala 
130 135 140 

lie Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg Arg Ala 
145 150 155 160 

Leu Arg lie Leu Gly lie Val Trp Gly Phe Ser Val Leu Phe Ser Leu 

165 170 175 

Pro Asn Thr Ser lie His Gly lie Lys Phe His Tyr Phe Pro Asn Gly 
180 185 190 

Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val lie Lys Pro Met Trp 
195 200 205 

He Tyr Asn Phe He lie Gin Val Thr Ser Phe Leu Phe Tyr Leu Leu 
210 215 220 

Pro Met Thr Val He Ser Val Leu Tyr Tyr Leu Met Ala Leu Arg Leu 
225 230 235 240 

Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn He Gin 

245 250 255 

Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val Leu Val Leu 
260 265 270 

Val Phe Ala He Cys Trp Ala Pro Phe His He Asp Arg Leu Phe Phe 
275 280 285 

Ser Phe Val Glu Glu Trp Ser Glu Ser Leu Ala Ala Val Phe Asn Leu 
290 295 300 

Val His Val Val Ser Gly Val Phe Phe Tyr Leu Ser Ser Ala Val Asn 
305 310 315 320 

Pro He He Tyr Asn Leu Leu Ser Arg Arg Phe Gin Ala Ala Phe Gin 

325 330 335 

Asn Val He Ser Ser Phe His Lys Gin Trp His Ser Gin His Asp Pro 
340 345 350 

Gin Leu Pro Pro Ala Gin Arg Asn He Phe Leu Thr Glu Cys His Phe 
355 360 365 

Val Glu Leu Thr Glu Asp He Gly Pro Gin Phe Pro Cys Gin Ser Ser 
370 375 380 

Met His Asn Ser His Leu Pro Thr Ala Leu Ser Ser Glu Gin Met Ser 
385 390 395 400 



Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 

405 410 
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<210> 5 

<211> 1658 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (55) . . (1299) 

<223> IGS4B long version 

<400> 5 

ggctcagctt gaaacagagc ctcgtaccag gggaggctca ggccttggat ttta atg 57 

Met 
1 

tea ggg atg gaa aaa ctt cag aat get tec tgg ate tac eag cag aaa 105 
Ser Gly Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Lys 

5 10 15 

eta gaa gat eea tte eag aaa cac etg aae age aee gag gag tat ctg 153 
Leu Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu 
20 25 30 

gee tte etc tgc gga eet egg ege age eac tte tte etc ccc gtg tct 201 
Ala Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val Ser 
35 40 45 

gtg gtg tat gtg eea att ttt gtg gtg ggg gtc att ggc aat gtc ctg 249 
Val Val Tyr Val Pro lie Phe Val Val Gly Val He Gly Asn Val Leu 
50 55 60 65 

gtg tgc ctg gtg att ctg eag cac cag get atg aagacg ccc aee aac 297 
Val Cys Leu Val He Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn 

70 75 80 

tac tac etc tte age ctg gcg gtc tct gae etc etg gtc ctg etc ctt 345 
Tyr Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu 
85 90 95 

gga atg ccc ctg gag gtc tat gag atg tgg cgc aac tac cot tte ttg 393 
Gly Met Pro Leu Glu Val Tyr Glu Mejt Trp Arg Asn Tyr Pro Phe Leu 
100 105 110 

tte ggg ccc gtg ggc tgc tac tte aag acg gee etc ttt gag ace gtg 441 
Phe Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val 
115 120 125 

tgc tte gee tec ate etc age ate ace ace gtc age gtg gag cgc tac 489 
Cys Phe Ala Ser He Leu Ser He Thr Thr Val Ser Val Glu Arg Tyr 
130 135 140 145 

gtg gee ate eta cac ccg tte cgc gee aaa ctg cag age ace egg cgc 537 
Val Ala He Leu His Pro Phe Arg Jlla Lys Leu Gin Ser Thr Arg Arg 

150 155 160 
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egg gcc etc agg ate etc ggc ate gtc tgg ggc ttc tec gtg etc ttc 585 
Arg Ala Leu Arg He Leu Gly He Val Trp Gly Phe Ser Val Leu Phe 
165 170 175 

tec ctg ccc aac acc age ate cat ggc ate aag ttc cac tac ttc ecc 633 
Ser Leu Pro Asn Thr Ser He His Gly He Lys Phe His Tyr Phe Pro 
180 185 190 

aat ggg tec ctg gtc cca ggt teg gcc ace tgt acg gtc ate aag ccc 681 
Asn Gly Ser Lieu Val Pro Gly Ser Ala Thr Cys Thr Val He Lys Pro 
195 200 205 

atg tgg ate tac aat ttc ate ate cag gtc ace tec ttc eta ttc tac 729 
Met Trp He Tyr Asn Phe He He Gin Val Thr Ser Phe Leu Phe Tyr 
210 215 220 225 

etc etc ccc atg act gtc ate agt gtc etc tac tac etc atg gea etc 777 
Leu Leu Pro Met Thr Val He Ser Val Leu Tyr Tyr Leu Met Ala Leu 

230 235 240 

aga eta aag aaa gac aaa tct ctt gag gca gat gaa ggg aat gea aat 825 
Arg Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn 
245 250 255 

att caa aga ccc tgc aga aaa tea gtc aac aag atg ctg ttt gtc ttg 873 
He Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val Leu 
260 265 270 

gtc tta gtg ttt get ate tgt tgg gcc ccg ttc cac att gac cga etc 921 
Val Leu Val Phe Ala He Cys Trp Ala Pro Phe His He Asp Arg Leu 
275 280 285 

ttc ttc age ttt gtg gag gag tgg act gaa tec ctg get get gtg ttc 969 
Phe Phe Ser Phe Val Glu Glu Trp Thr Glu Ser Leu Ala Ala Val Phe 
290 295 300 305 

aac etc gtc cat gtg gtg tea ggt gtc tta ttc tac ctg age tea get 1017 
Asn Leu Val His Val Val Ser Gly Val Leu Phe Tyr Leu Ser Ser Ala 

310 315 320 

gtc aac ccc att ate tat aac eta ctg tct cgc cgc ttc cag gca gca 1065 
Val Asn Pro He He Tyr Asn Leu Leu Ser Arg Arg Phe Gin Ala Ala 
325 330 335 

ttc cag aat gtg ate tct tct ttc cac aaa cag tgg cac tec cag cat 1113 
Phe Gin Asn Val He Ser Ser Phe His Lys Gin Trp His Ser Gin His 
340 345 350 

gac cca cag ttg cca cct gcc cag egg aac ate ttc ctg aca gaa tgc 1161 
Asp Pro Gin Leu Pro Pro Ala Gin Arg Asn He Phe Leu Thr Glu Cys 
355 360 365 



cac ttt gtg gag ctg acc gaa gat ata ggt ccc caa ttc eta tgt cag 
His Phe Val Glu Leu Thr Glu Asp He Gly Pro Gin Phe Leu Cys Gin 
370 375 380 385 



1209 
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tea tec gtg cac aac tct cac etc cea aca gee etc tet agt gaa eag 1257 
Ser Ser Val His Asn Ser His Leu Pro Thr Ala Leu Ser Ser Glu Gin 

390 395 400 



atg tea aga aca aac tat caa age ttc cac ttt aac aaa acc 1299 
Met Ser Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 
405 410 415 



tgaattcttt cagagctgac tctcctctat 
aatgtatgcc ttctcatatg aaattagaga 
attgctagtt tttttttttt aataaacgtg 
ccaagactgc ctgattttta gttatctttc 
tagttcatgc caagaacgtg actggaaagg 
taatggaaat ggttcgtcct gagtcatcta 



gcctcaaaac ttcagagagg aacatcccat 1359 
ggtagaatgg ctcttacaac tcatgtaccc 1419 
aaaactgaga gttagatctg gtttcaaaac 1479 
cactatccta actgcctcat gccccttcac 1539 
eatggcacct ataccttgat taatttccat 1599 
cgttccgagt caggctgtca ctcctacta 1658 



<210> 6 

<211> 415 

<212> PRT 

<213> Homo sapiens 

<400> 6 

Met Ser Gly Met Glu Lys Leu Gin Asn Ala Ser Trp He Tyr Gin Gin 
15 10 15 

Lys Leu Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr 
20 25 30 

Leu Ala Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val 
35 40 45 

Ser Val Val Tyr Val Pro He Phe Val Val Gly Val He Gly Asn Val 
50 55 60 

Leu Val Cys Leu Val He Leu Gin His Gin Ala Met Lys Thr Pro Thr 
65 70 75 80 

Asn Tyr Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu 

85 90 95 

Leu Gly Met Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe 
100 105 110 

Leu Phe Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr 
115 120 125 

Val Cys Phe Ala Ser He Leu Ser He Thr Thr Val Ser Val Glu Arg 
130 135 140 



Tyr Val Ala He Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg 
145 150 155 160 
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Arg Arg Ala Leu Arg lie Leu Gly lie Val Trp Gly Phe Ser Val Leu 

165 170 175 

Phe Ser Leu Pro Asn Thr Ser He His Gly He Lys Phe His Tyr Phe 
180 185 190 

Pro Asn Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val He Lys 
195 200 205 

Pro Met Trp He Tyr Asn Phe He He Gin Val Thr Ser Phe Leu Phe 
210 215 220 

Tyr Leu Leu Pro Met Thr Val He Ser Val Leu Tyr Tyr Leu Met Ala 
225 230 235 240 

Leu Arg Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala 

245 250 255 

Asn He Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val 
260 265 270 

Leu Val Leu Val Phe Ala He Cys Trp Ala Pro Phe His He Asp Arg 
275 280 285 

Leu Phe Phe Ser Phe Val Glu Glu Trp Thr Glu Ser Leu Ala Ala Val 
290 295 300 

Phe Asn Leu Val His Val Val Ser Gly Val Leu Phe Tyr Leu Ser Ser 
305 310 315 320 

Ala Val Asn Pro He He Tyr Asn Leu Leu Ser Arg Arg Phe Gin Ala 

325 330 335 

Ala Phe Gin Asn Val He Ser Ser Phe His Lys Gin Trp His Ser Gin 
340 345 350 

His Asp Pro Gin Leu Pro Pro Ala Gin Arg Asn He Phe Leu Thr Glu 
355 360 365 

Cys His Phe Val Glu Leu Thr Glu Asp He Gly Pro Gin Phe Leu Cys 
370 375 380 

Gin Ser Ser Val His Asn Ser His Leu Pro Thr Ala Leu Ser Ser Glu 
385 390 395 400 

Gin Met Ser Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 

405 410 415 



<210> 7 

<211> 1658 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> CDS 

<222> (64) . . (1299) 

<223> IGS4B short version 

<400> 7 

ggctcagctt gaaacagagc ctcgtaccag gggaggctca ggccttggat tttaatgtca 60 

ggg atrg gaa aaa ctt cag aat get tec tgg ate tac cag cag aaa eta 108 
Met Glu Isys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Lys Leu 
15 10 15 

gaa gat cca ttc cag aaa eac ctg aac age acc gag gag tat etg gee 156 
Glu Asp Pro Phe Gin Lys His Leu Asn Sex Thr Glu Glu Tyr Leu Ala 

20 25 30 



ttc etc tgc gga cct egg cgc age eac tte tte etc cee gtg tet gtg 204 
Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val Ser Val 
35 40 45 

gtg tat gtg eca att ttt gtg gtg ggg gte att ggc aat gtc ctg gtg 252 
Val Tyr Val Pro lie Phe Val Val Gly Val He Gly Asn Val Leu Val 
50 55 60 

tgc ctg gtg att ctg cag cac cag get atg aag acg ccc acc aac tac 3 00 
Cys Leu Val He Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn Tyr 
65 70 75 

tac etc ttc age ctg gcg gtc tct gac etc ctg gtc ctg etc ctt gga 348 
Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu Gly 
80 85 90 95 



atg ccc ctg gag gtc tat gag atg tgg cgc aac tac cct ttc ttg ttc 
Met Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe Leu Phe 

100 105 110 



396 



ggg ccc gtg ggc tgc tac ttc aag acg gee etc ttt gag acc gtg tgc 444 
Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val Cys 
115 120 125 

tte gee tee ate etc age ate ace aee gtc age gtg gag cgc tae gtg 4 92 
Phe Ala Ser He Leu Ser He Thr Thr Val Ser Val Glu Arg Tyr Val 
130 135 140 



gee ate eta cae eeg ttc cgc gee aaa ctg eag age aee egg cgc egg 540 
Ala He Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg Arg 
145 150 155 

gcc etc agg ate etc ggc ate gte tgg ggc ttc tec gtg etc tte tec 588 
Ala Leu Arg He Leu Gly He Val Trp Gly Phe Ser Val Leu Phe Ser 
160 165 170 175 

ctg ccc aac aee age ate cat ggc ate aag tte eac tac ttc cee aat 636 
Leu Pro Asn Thr Ser He His Gly He Lys Phe His Tyr Phe Pro Asn 

180 185 190 
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ggg tec ctg gtc cca ggt teg gcc acc tgt acg gte ate aag cce atg 

Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val lie Lys Pro Met 
195 200 205 



684 



tgg ate tac aat ttc ate ate cag gtc acc tec ttc eta tte tac etc 
Trp lie Tyr Asn Phe He He Gin Val Thr Ser Phe Leu Phe Tyr Leu 
210 215 220 



732 



etc ccc atg act gtc ate agt gte etc tac tac etc atg gca etc aga 
Leu Pro Met Thr Val He Ser Val Leu Tyr Tyr Leu Met Ala Leu Arg 
225 230 235 



780 



eta aag aaa gae aaa tet ctt gag gca gat gaa ggg aat gca aat att 
Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn He 
240 245 250 255 



828 



caa aga ccc tgc aga aaa tea gtc aac aag atg ctg ttt gte ttg gte 
Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val Leu Val 

260 265 270 



876 



tta gtg ttt get ate tgt tgg gee ecg ttc eac att gae ega etc tte 
Leu Val Phe Ala He Cys Trp Ala Pro Phe His He Asp Arg Leu Phe 
275 280 285 



924 



ttc age ttt gtg gag gag tgg act gaa tee ctg get get gtg tte aac 
Phe Ser Phe Val Glu Glu Trp Thr Glu Ser Leu Ala Ala Val Phe Asn 
290 295 300 



972 



etc gtc eat gtg gtg tea ggt gte tta tte tac ctg age tea get gte 
Leu Val His Val Val Ser Gly Val Leu Phe Tyr Leu Ser Ser Ala Val 
305 310 315 



1020 



aac ccc att ate tat aac eta ctg tet cge ege ttc cag gca gca ttc 
Asn Pro He He Tyr Asn Leu Leu Ser Arg T^g Phe Gin Ala Ala Phe 
320 325 330 335 



1068 



cag aat gtg ate tet tet ttc cae aaa cag tgg eac tee cag cat gae 
Gin Asn Val He Ser Ser Phe His Lys Gin Trp His Ser Gin His Asp 

340 345 350 



1116 



cca cag ttg cca cct gee cag egg aac ate tte ctg aca gaa tgc eac 
Pro Gin Leu Pro Pro Ala Gin Arg Asn He Phe Leu Thr Glu Cys His 
355 360 365 



1164 



ttt gtg gag ctg ace gaa gat ata ggt ccc caa ttc eta tgt cag tea 
Phe Val Glu Leu Thr Glu Asp He Gly Pro Gin Phe Leu Cys Gin Ser 
370 375 380 



1212 



tec gtg cac aac tet eac etc cca aca gcc etc tet agt gaa cag atg 
Ser Val His Asn Ser His Leu Pro Thr Ala Leu Ser Ser Glu Gin Met 
385 390 395 



1260 



tea aga aca aac tat caa age ttc cac ttt aac aaa acc tgaattettt 
Ser Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 
400 405 410 



1309 
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cagagctgac tctcctctat gcctcaaaac ttcagagagg aacatcccat aatgtatgcc 1369 
ttctcatatg aaattagaga ggtagaatgg ctcttacaac tcatgtaccc attgctagtt 1429 
tttttttttt aataaacgtg aaaactgaga gttagatctg gtttcaaaac ccaagactgc 1489 
ctgattttta gttatctttc cactatccta actgcctcat gccccttcac tagttcatgc 1549 
caagaacgtg actggaaagg catggcacct ataccttgat taatttccat taatggaaat 1609 
ggttcgtcct: gagtcatcta cgttccgagt caggctgtca cticctacta 1658 



<210> 8 
<211> 412 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Lys Leu Glu 
15 10 15 

Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu Ala Phe 
20 25 30 

Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val Ser Val Val 
35 40 45 

Tyr Val Pro lie Phe Val Val Gly Val lie Gly Asn Val Leu Val Cys 
50 55 60 

Leu Val lie Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn Tyr Tyr 
65 70 75 80 

Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu Gly Met 

85 90 95 

Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe Leu Phe Gly 
100 105 110 

Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val Cys Phe 
115 120 125 

Ala Ser lie Leu Ser lie Thr Thr Val Ser Val Glu Arg Tyr Val Ala 
130 135 140 

lie Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg Arg Ala 
145 150 155 160 

Leu Arg lie Leu Gly lie Val Trp Gly Phe Ser Val Leu Phe Ser Leu 

165 170 175 

Pro Asn Thr Ser lie His Gly lie Lys Phe His Tyr Phe Pro Asn Gly 
180 185 190 

Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val lie Lys Pro Met Ta:p 
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195 200 205 

lie Tyr Asn Phe lie lie Gin Val Thr Ser Phe Leu Phe Tyx Leu Leu 
210 215 220 

Pro Met Thr Val lie Ser Val Leu Tyr Tyr Leu Met Ala Leu Arg Leu 
225 230 235 240 

Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn lie Gin 

245 250 255 

Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Phe Val Leu Val Leu 
260 265 270 

Val Phe Ala lie Cys Trp Ala Pro Phe His lie Asp Arg Leu Phe Phe 
275 280 285 

Ser Phe Val Glu Glu Trp Thr Glu Ser Leu Ala Ala Val Phe Asn Leu 
290 295 300 

Val His Val Val Ser Gly Val Leu Phe Tyr Leu Ser Ser Ala Val Asn 
305 310 315 320 

Pro lie lie Tyr Asn Leu Leu Ser Arg Arg Phe Gin Ala Ala Phe Gin 

325 330 335 

Asn Val lie Ser Ser Phe His Lys Gin Trp His Ser Gin His Asp Pro 
340 345 350 

Gin Leu Pro Pro Ala Gin Arg Asn lie Phe Leu Thr Glu Cys His Phe 
355 360 365 

Val Glu Leu Thr Glu Asp lie Gly Pro Gin Phe Leu Cys Gin Ser Ser 
370 375 380 

Val His Asn Ser His Leu Pro Thr Ala Leu Ser Ser Glu Gin Met Ser 
385 390 395 400 

Arg Thr Asn Tyr Gin Ser Phe His Phe Asn Lys Thr 

405 410 



<210> 9 
<211> 1594 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (55) . . (942) 

<223> 1GS4A truncated DNA long version 
<400> 9 

ggctcagctt gaaacagagc ctcgtaccag gggaggctca ggccttggat ttta atg 57 

Met 
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tea S99 atg gaa aaa ctt cag aat get tec tgg ate tac eag cag aaa 105 
Ser Gly Met Glu Dys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Lys 
,5 10 15 

eta gaa gat cca tte eag aaa eac ctg aac age ace gag gag tat etg 153 
Leu Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu 
20 25 30 

gee ttc etc tgc gga cct egg cgc age eac tte tte etc cee gtg tet 201 
Ala Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val Ser 
35 40 45 

gtg gtg tat gtg cca att ttt gtg gtg ggg gtc att gge aat gte ctg 249 
Val Val Tyr Val Pro lie Phe Val Val Gly Val lie Gly Asn Val Leu 
50 55 60 65 

gtg tgc etg gtg att etg cag eac cag get atg aag acg cee ace aac 297 
Val Cys Leu Val lie Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn 

70 75 80 

tac tae etc ttc age etg geg gte tet gae etc etg gte etg etc ctt 345 
Tyr Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu 
85 90 95 

gga atg cee etg gag gtc tat gag atg tgg ege aac tac cct ttc ttg 393 
Gly Met Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe Leu 
100 105 110 

tte ggg ccc gtg ggc tgc tae ttc aag acg gee etc ttt gag ace gtg 441 
Phe Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val 
115 120 125 

tgc ttc gee tee ate etc age ate ace ace gtc age gtg gag cgc tac 489 
Cys Phe Ala Ser lie Leu Ser lie Thr Thr Val Ser Val Glu Arg Tyr 
130 135 140 145 

gtg gee ate eta eac ecg ttc cgc gee aaa etg eag age ace egg cgc 537 
Val Ala lie Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg 

150 155 160 

egg gee etc agg ate etc gge ate gte tgg gge ttc tee gtg etc ttc 585 
Arg Ala Leu Arg lie Leu Gly He Val Trp Gly Phe Ser Val Leu Phe 
165 170 175 

tee etg ccc aac ace age ate eat gge ate aag tte eac tac tte cee 633 
Ser Leu Pro Asn Thr Ser He His Gly He Lys Phe His Tyr Phe Pro 
180 185 190 

aat ggg tee etg gte cca ggt teg gee ace tgt acg gtc ate aag ccc 681 
Asn Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val He Lys Pro 
195 200 205 



atg tgg ate tac aat tte ate ate eag gtc ace tee ttc eta ttc tac 
Met Trp He Tyr Asn Phe He He Gin Val Thr Ser Phe Leu Phe Tyr 



729 
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210 215 220 225 

etc etc ccc atg act gtc ate agt gtc etc tac tac etc atg gca etc 777 
Leu Ijeu Pro Met Thr Val lie Ser Val Leu Tyr Tyr Leu Met Ala Leu 

230 235 240 

aga eta aag aaa gac aaa tct ctt gag gca gat gaa ggg aat gca aat 825 
Arg Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn 
245 250 255 

att caa aga ccc tgc aga aaa tea gtc aac aag atg ctg tct ttg tgg 873 
lie Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Ser Leu Trp 
260 265 270 

agg agt gga gtg aat ccc tgg ctg ctg tgt tea acc teg tec atg tgg 921 
Arg Ser Gly Val Asn Pro Trp Leu Leu Cys Ser Thr Ser Ser Met Trp 
275 280 285 

tgt cag gtg tct tct tct acc tgagetcagc tgtcaaecec attatetata 972 
Cys Gin Val Ser Ser Ser Thr 
290 295 

aectactgte tegcegctte caggcageat teeagaatgt gatctcttct ttccacaaac 1032 
agtggeaetc ccagcatgac ceacagttge eacctgceca gcggaacatc ttcetgacag 1092 
aatgeeactt tgtggagctg accgaagata taggtcccca atteccatgt eagtcatcca 1152 
tgcaeaacte tcacctccea acagccetct etagtgaaea gatgtcaaga acaaaetatc 1212 
aaagctteca ctttaacaaa acctgaattc tttcagaget gactetcetc tatgccteaa 1272 
aacttcagag aggaacatcc eataatgtat gccttctcat atgatattag agaggtagaa 1332 
tggctcttac aaetcatgta eceattgcta gttttttttt tttaataaac gtgaaaaetg 1392 
agagttagat ctggtttcaa aaeccaagac tgeetgattt ttagttatet ttccaetatc 1452 
etaactgcet catgcccctt eactagttca tgccaagaac gtgactggaa aggeatggca 1512 
cctatacctt gattaatttc cattaatgga aatggttegt cctgagtcat ctacgttccg 1572 
agtcaggetg tcactcetac ta 1594 



<210> 10 

<211> 296 

<212> PRT 

<213> Homo sapiens 

<400> 10 

Met Ser Gly Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin 
15 10 15 

Lys Leu Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr 
20 25 30 
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Leu Ala Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val 
35 40 45 

Ser Val Val Tyr Val Pro lie Phe Val Val Gly Val lie Gly Asn Val 
50 55 60 

Leu Val Cys Leu Val lie Leu Gin His Gin Ala Met Lys Thr Pro Thr 
65 70 75 80 

Asn Tyr Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu 

85 90 95 

Leu Gly Met Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe 
100 105 110 

Leu Phe Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr 
115 120 125 

Val Cys Phe Ala Ser lie Leu Ser lie Thr Thr Val Ser Val Glu Arg 
130 135 140 

Tyr Val Ala lie Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Ikrg 
145 150 155 160 

Arg Arg Ala Leu Arg lie Leu Gly lie Val Trp Gly Phe Ser Val Leu 

165 170 175 

Phe Ser Leu Pro Asn Thr Ser lie His Gly lie Lys Phe His Tyr Phe 
180 185 190 

Pro Asn Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val lie Lys 
195 200 205 

Pro Met Trp lie Tyr Asn Phe lie lie Gin Val Thr Ser Phe Leu Phe 
210 215 220 

Tyr Leu Leu Pro Met Thr Val lie Ser Val Leu Tyr Tyr Leu Met Ala 
225 230 235 240 

Leu Arg Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala 

245 250 255 

Asn lie Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Ser Leu 
260 265 270 

Trp Arg Ser Gly Val Asn Pro Trp Leu Leu Cys Ser Thr Ser Ser Met 
275 280 285 

Trp Cys Gin Val Ser Ser Ser Thr 
290 295 



<210> 11 
<211> 1594 
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<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (64) . . (942) 

<223> IGS4A truncated DNA short version 
<400> 11 

ggctcagctt gaaacagagc ctcgtaccag gggaggctca ggccttggat tttaatgtca 60 

ggg atg gaa aaa ctt cag aat get tec tgg ate tac cag cag aaa eta 108 
Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Lys beu 
15 10 15 

gaa gat oca ttc cag aaa cac ctg aac age ace gag gag tat ctg gee 156 
Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu Ala 

20 25 30 

ttc etc tge gga cct egg egc age cac ttc ttc etc cec gtg tct gtg 204 
Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val Ser Val 
35 40 45 

gtg tat gtg cea att ttt gtg gtg ggg gtc att gge aat gtc ctg gtg 252 
Val Tyr Val Pro lie Phe Val Val Gly Val He Gly Asn Val Leu Val 
50 55 60 

tgc ctg gtg att ctg cag cac cag get atg aag aeg cec acc aac tac 300 
Cys Leu Val He Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn Tyr 
65 70 75 

tac etc ttc age ctg gcg gtc tct gac etc ctg gtc ctg etc ctt gga 348 
Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu Gly 
80 85 90 95 

atg cec ctg gag gtc tat gag atg tgg cgc aac tac cct ttc ttg ttc 396 
Met Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe Leu Phe 

100 105 110 



ggg cec gtg gge tge tac ttc aag aeg gee etc ttt gag ace gtg tgc 
Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val Cys 
115 120 125 



444 



ttc gee tec ate etc age ate acc acc gtc age gtg gag egc tac gtg 4 92 
Phe Ala Ser He Leu Ser He Thr Thr Val Ser Val Glu Arg Tyr Val 
130 135 140 

gee ate eta cac eeg ttc egc gee aaa ctg cag age ace egg cgc egg 540 
Ala He Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg Arg 
145 150 155 

gee etc agg ate etc gge ate gtc tgg gge ttc tec gtg etc ttc tec 588 
Ala Leu Arg He Leu Gly He Val Trp Gly Phe Ser Val Leu Phe Ser 
160 165 170 175 



ctg cec aac ace age ate cat gge ate aag ttc cac tac ttc cec aat 



636 
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Leu Pro Asn Thr Ser lie His Gly lie Lys Phe His Tyr Phe Pro Asn 



180 



185 



190 



999 tec ctg gtc cca ggt teg gcc acc tgt acg gtc ate aag ccc atg 684 
Gly Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val He Lys Pro Met 
195 200 205 

tgg ate tae aat ttc ate ate eag gte aec tec ttc eta ttc tac etc 732 
Trp He Tyr Asn Phe He He Gin Val Thr Ser Phe Leu Phe Tyr Leu 
210 215 220 

etc ccc atg act gtc ate agt gtc etc tae tae etc atg gea etc aga 780 
Leu Pro Met Thr Val He Ser Val Leu Tyr Tyr Leu Met Ala Leu Arg 
225 230 235 

eta aag aaa gac aaa tct ctt gag gca gat gaa ggg aat gea aat att 82 8 
Leu Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn He 
240 245 250 255 

caa aga ccc tge aga aaa tea gtc aac aag atg ctg tct ttg tgg agg 876 
Gin Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Ser Leu Trp Arg 

260 265 270 

agt gga gtg aat ccc tgg ctg ctg tgt tea acc teg tec atg tgg tgt 924 
Ser Gly Val Asn Pro Trp Leu Leu Cys Ser Thr Ser Ser Met Trp Cys 
275 280 285 

eag gtg tct tct tct acc tgageteage tgtcaaeece attatctata 972 
Gin Val Ser Ser Ser Thr 
290 

aeetaetgtc tegcegctte caggcagcat teeagaatgt gatctettet tteeaeaaac 1032 
agtggeactc ecagcatgac ccacagttgc cacctgecca gcggaacatc ttcctgacag 1092 
aatgeeaett tgtggagetg accgaagata taggtceeca attceeatgt cagteatcca 1152 
tgcacaaete teaectccea acagcectet etagtgaaca gatgtcaaga acaaactate 1212 
aaagcttcea etttaaeaaa acctgaatte tttcagaget gaeteteetc tatgecteaa 1272 
aactteagag aggaaeatee eataatgtat geettctcat atgatattag agaggtagaa 1332 
tggctettae aaetcatgta eccattgeta gttttttttt tttaataaac gtgaaaaetg 1392 
agagttagat etggttteaa aacceaagae tgeetgattt ttagttatct tteeactate 1452 
etaaetgcct catgeccctt cactagttea tgceaagaac gtgactggaa aggeatggea 1512 
cetataectt gattaatttc eattaatgga aatggttcgt ectgagteat ctaegttecg 1572 
agteaggetg teaeteetae ta 1594 



<210> 12 
<211> 293 
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<212> PRT 

<213> Homo sapiens 
<400> 12 

Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyx Gin Gin Lys Leu Glu 
15 10 15 

Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu Ala Phe 
20 25 30 

Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val Ser Val Val 
35 40 45 

Tyr Val Pro lie Phe Val Val Gly Val He Gly Pisn Val Leu Val Cys 
50 55 60 

Leu Val He Leu Gin His Gin Ala Met Lys Thr Pro Thr Asn Tyr Tyr 
65 70 75 80 

Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu Gly Met 

85 90 95 

Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe Leu Phe Gly 
100 105 110 

Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val Cys Phe 
115 120 125 

Ala Ser He Leu Ser He Thr Thr Val Ser Val Glu Arg Tyr Val Ala 
130 135 140 

He Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg Arg Ala 
145 150 155 160 

Leu Arg He Leu Gly He Val Trp Gly Phe Ser Val Leu Phe Ser Leu 

165 170 175 

Pro Asn Thr Ser He His Gly He Lys Phe His Tyr Phe Pro Asn Gly 
180 185 190 

Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val He Lys Pro Met Trp 
195 200 205 

He Tyr Asn Phe He He Gin Val Thr Ser Phe Leu Phe Tyr Leu Leu 
210 215 220 

Pro Met Thr Val He Ser Val Leu Tyr Tyr Leu Met Ala Leu Arg Leu 
225 230 235 240 

Lys Lys Asp Lys Ser Leu Glu Ala Asp Glu Gly Asn Ala Asn He Gin 

245 250 255 

Arg Pro Cys Arg Lys Ser Val Asn Lys Met Leu Ser Leu Trp Arg Ser 
260 265 270 



Gly Val Asn Pro Trp Leu Leu Cys Ser Thr Ser Ser Met Tirp Cys Gin 
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275 280 285 

Val Ser Ser Ser Thr 
290 



<210> 13 
<211> 26 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Degenerated 
primers 

<220> 

<221> variation 

<222> (21) 

<223> A,C,G or T 

<220> 

<221> variation 

<222> (24) 

<223> A, C, G or T 

<400> 13 

ctcatcttcg cggtgggcrc ngyngg 26 



<210> 14 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Degenerated 
primers 

<220> 

<221> variation 

<222> (22) 

<223> C or Inosine 

<220> 

<221> variation 

<222> (25) 

<223> A, C, G or T 

<220> 

<221> variation 

<222> (28) 

<223> A, C, G or T 



<400> 14 

ggccaggcag cgctccgcgc tnamcyngc d 



31 
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<210> 15 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Degenerated 
primers 

<400> IS 

gaartartag ccrcgrcagc cw 



<210> 16 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Primer 
<400> 16 

ccatcctaat acgactcact atagggc 



<210> 17 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 17 

actcactata gggctcgagc ggc 



<210> 18 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 18 

ggatcccaaa taagaaaggg tagttgc 



<:210> 19 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Primer 



<400> 19 

aaagggtagt tgcgccacat ctcatagac 



29 



<210> 20 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 20 

aggtctatga gatgtggcgc aactaccct 29 

<:210> 21 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<210> 22 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Degenerated 
primers 

<400> 22 

cggaagttgg cggacacgrv rttrta 26 

<210> 23 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 21 

atgtggcgca actacccttt cttatttggg 



30 



<400> 23 

gctcagcttg aaacagagcc tcgtacc 



27 
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<210> 24 
<211> 27 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 24 

ccatgtggat ctacaatttc atcatcc 27 

<210> 25 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<210> 26 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 26 

gatgctgttt gtcttggtct tagtgtttgc 30 



<210> 27 
<211> 29 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sec[uence: Primer 



<400> 25 

aagacaaatc tcttgaggca gatgaaggg 



29 



<400> 27 

ggatgatgaa attgtagatc cacatgggc 



29 



<210> 28 
<211> 25 
<212> DNA 



<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
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<400> 28 

tgtggagaag tctctcaaag tgtgg 



25 



<210> 29 
<211> 29 
<212> DNA 

<213> Art:i£icial Sec[uence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 29 

tagtaggagt gacagcctga ctcggaacg 2 9 



<210> 30 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<210> 31 
<:211> 22 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 31 

tcgtaccagg ggaggctcag gc 22 



<210> 32 
<211> 23 
<212> DNA 

<213> Artificial Seq[uence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 30 

aacgtagatg actcaggacg aaccatttcc 



30 



<400> 32 

cctcttcagc ctggcggtct ctg 



23 



<210> 33 
<211> 22 
<212> DNA 



<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Primer 
<400> 33 

ggaggcgaag cacacggtct ca 



<210> 34 
<211> 34 
<:212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Primer 
<220> 

<221> misc_binding 
<222> (1) 

<223> Labeled with 6-carboxyf luorescein 
<220> 

<221> misc_binding 

<222> (34) 

<:223> Labeled with 

N,N,N' ,N' -tetramethyl-6-carboxyrhodamin 



<400> 34 

agatgtggcg caactaccct ttcttgttcg ggcc 



34 



